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Poisson, see Poisson 
power series, 570, 587 
Weibull, 503 
Dose response curve, single-hit, 184 
Double-blind, 405 
Ecology, 175, and see entomology, fish, 
genetics, population, sampling 
Economics, 618, and see minimum cost, 
programming, risk 
Efficiency, 254, 481, 530, 627, and see 
estimation, maximum likelihood, 
sufficient statistic 
asymptotic, 63, 573 
of estimation, 560, 562 
Entomology, 188, 349, 501, 585, 658, 
and see toxicology 
Epidemiology, 170, 172 
Ergodic chain, 169 
Errata, 333, 509, 581, 669 
Error, 
correlated, 297 
gross, 182 
interblock, 577, and see information 
intrablock, 577 
mean square, see variance 
rate, 326, 539 
regression, see regression 
term, proper, 144 
theory of, see analysis of variance, 
least squares, maximum likelihood, 
model, rejection of data 
type I, 324 
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type II, 324 
type III, 324 
Estimation, 18, 277, 284, 372, 571, and 
see bias, efficiency, fitting con- 
stants, gene frequency, information, 
least squares, maximum likelihood, 
method of moments, population, 
score, sufficient statistic 
efficiency of, 26 
nonparametric, 58, 657 
of distribution, 530 
of frequency, 29 
of mean, 26 
of optimum, 492 
probability, 18 
of ratio, 590 
of variance components, 438 
optimum, 48 
unbiased, 267 
Expectation, mathematical, 610, and 
see mean, moments 
conditional, 389 
Experimental design, see design of 
experiments 
Experimental units, see selection 
used as own controls, 408 
Extinction curve, 125 
Extinction time, 160 
Extreme, 160 
Factor analysis, 331, and see com- 
munality, multivariate analysis 
Factorial experiment, 171, 181, 319, and 
see bioassay, fractional replication 
asymmetrical, 176 
Feedback, 139 
Fiducial limits, see confidence limits 
Field experiments, 167, and see agron- 
omy, analysis of groups of experi- 
ments, design of experiments 
Fieller, 643 


Fish, 167, 415, and see ecology, popula- - 


tion management 
scales, 167 
Fitness, 175, 388 
Fitting constants, 1, and see least 
squares 
Fitting distribution, see estimation 
Fitting regression line, see covariance, 
least squares, regression 
both variables subject to error, 283 
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Follow-up studies, missing data, 57, 653 
Fourfold table, 173, 176, and see chi 
square, contingency 
Game, 167 
matrix, 167 
Gamma distribution, 51 
Gene frequency, estimation of, 153 
Generating function, 431, 587, and see 
moments, probability 
Genetic combining ability, general, 230 
Genetic correlation, 359, 474, and see 
genetic variance, heritability, path 
coefficients. repeatability 
Genetic covariance, matrix, 503, and 
see genetic correlation, genetic 
variance 
Genetic model, see model 
Genetics, 169, 261, 329, 387, and see 
animal breeding, breeding value, 
chromosome, diallel cross, fitness, 
gene frequency, genetic, heritability, 
inbreeding, linkage, outbreeding, 
repeatability, weighting 
population, 168, 658 
Genetic selection, 
effect of linkage, 506 
index, 501, 502 
Genetic variance, 174, 475, 483 
components, 233, 445 
Geometric series, 584 
Gompertz curve, 89 
Goodness of fit, 253, 261, 507, 584, and 
see chi square 
Gradient, 48 
Graeco-latin square, 406 
Graphics, 86, 95, 178, 495, 649, 658, 
and see computation 
Grouping, 26, and see qualitative data 
Growth, 89, 170, 174, 220, 332, 333, 
349, 634, 653, and see allometry, 
biometry, logistic curve, population 
experiments, 506 
rate, 352 
Half-life, 127, and see three-quarters- 
life 
Half-replicate design, 500 
Heredity, see genetics 
Heritability, 173, 364, 553, 626, and see 
genetic correlation, repeatability 
estimation of, 481 
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Heteroscedasticity, 167 
Horticulture, 97, 333, 634, 
agronomy 
Hotelling’s T, 167 
Hyperbolic distribution, truncated, 278 
Hypergeometric distribution, 53, 495, 
569, 620 
Hypothesis, 635, and see law, model, 
parsimony, test 
test of, 372 
Identifiability, 
over-, 142 
problem of, 140 
under-, 142 
Inbreeding, 232, 445, 
breeding 
Incomplete blocks, 176, 406, 646, and 
see lattice, randomized blocks 
balanced, 111 
efficiency of, 113 
Infection model, 168 
Information. 26, 120, 155, 554, 591, and 
see analysis of variance, design of 
experiments, estimation, maximum 
likelihood 
interblock. 459, 648 
intrablock, 459 
loss of, 117 
by truncation, 155 
matrix, 91, 530, 573 
empirical, 93 
Integral equations, 351 
Interaction, see analysis of variance, 
model 
as error term, 145 
assumed zero, 145 
confounded, 319 
genotype-environment, 168, 387, 475 
Tsobole, 187, 658 
Iteration, 2, 93, 154, 375, 590, and see 
computation, least squares, maxi- 
mum likelihood 
Jacobian, 399 
Judging, see organolepsis 
Kurtosis, 528 
Lagrange multipliers, 637 
Latin square, 405 
cyclic, 407 
Lattice, balanced, augmented, 448 
Law, see hypothesis, model 


and see 


and see out- 
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inverse square, 125 
inverse square-root, 125 
inverse two-thirds, 125 
single-process, 125 
Least squares, see analysis of variance, 
estimation, fitting constants, itera- 
tion, matrix, normal equations 
almost, 167 
equations, 91 
estimate, 291. 463 
Life, expectation of, 64 
Life table, 57 
construction, 74 
Likelihood, 62, 157, and see maximum 
likelihood 
Linkage, 506 
Logistic curve, see growth, logit trans- 
formation, population 
fitting, 89 
generalized, 89 
Main effect, 372, and see analysis of 
variance, interaction 
Mantoux test, 33 
Matching, see balancing, paired com- 
parisons 
Matrix, 169, and see characteristic roots, 
covariance matrix, information ma- 
trix, multivariate analysis 
augmented, 397 
classification, 139 
diagonal, 374 
dispersion, 12, 95 
inversion, 95, 248 
rank of, 112, 394 
Maximum likelihood, 1, 58, 121, 142, 
NGB, WS, Ba ih, LOR Beh byte 
584, 658, and see efficiency, esti- 
mation, information 
bias, 121, 554 
Mean, see moments 
adjusted, see least squares 
harmonic, 491 
of product, 172 
Measurement, 176, 177 
Medicine, see analgesia, anesthesia, 
bacteriology, chemotherapy, clini- 
cal, diagnosis, genetics, morbidity, 
mortality, pharmacology, physiol- 
ogy, therapeutic index, toxicology 
Metameter, 180, 657, and see scale 
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Migration, 508, 571 
Minimum cost, 12 
Missing data, see fitting constants, 
follow-up studies 
Mitscherlich’s law, 492 
Model, see biometry, causation, com- 
ponents of variance, feedback, 
hypothesis, path coefficients, par- 
simony, regression, structural, trans- 
formation 
analysis of variance, mixed, 145, 607 
asymptotic, 121 
birth-death, 168 
departures from, 579 
diagnostic, 299 
exponential, 125 
formation of, 120 
genetic, 361, 481, 627 
growth, 332 
host-pathogen, 386 
least squares, 463, 577 
linear, 125, 660 
logistic, 372 
mathematical, 33 
organoleptic, 252 
parabolic, 125 
quantal response, 188 
rectangular hyperbola, 125 
regression, 580 
static, 177 
statistical, 33 
stochastic, 168, 177 
test of, 108 
Moments, see cumulants 
conditional, 598 
factorial, 536, 568, 598 
generating function, 589 
method of, 535, 571 
product, see covariance 
Monte Carlo, 501, 504, 506 
Morbidity, 57, 169 
duration, 172 
quantitation, 173 
Mortality, 349 
radiation, 508 
Multivariate analysis, 164, 165, and see 
analysis of variance, discriminant 
function, factor analysis, Hotelling’s 
T, matrix 
quantal, 657 
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Multinomial distribution, 30, 373 
Nervous system, 187, and see analgesia, 
anesthesia 
Newman-Keuls test, see multiple range 
test 
Newton-Raphson method, 128, 374, and 
see Newton’s method 
Newton’s method, 154, and see Newton- 
Raphson method 
Noncentrality, 501 
Normal distribution, multivariate, 12 
Normal equations, 211, 451, and see 
least squares, matrix 
nonlinear, 644 
Odor tests, see organolepsis 
Operating characteristic, 186 
Order, 10, and see test 
Order statistics, 26 
Organolepsis, 251, and see judging, 
score, subjective evaluation, triangle 
test 
Orthogonal parameters, 108 
Outbreeding, 174, and see inbreeding 
Pairing, see balancing, paired com- 
parisons 
Palatability, see organolepsis 
Parasitology, 617, and see pathology 
Parsimony, 124 
Path coefficients, 475, and see causation, 
correlation 
Path analysis, nonlinear, 120 
Path regression, nonlinear, 135 
Pathology, 165, 386, 617 
plant, 331 
Percentage, see proportion 
Perception, see organolepsis 
Pharmacology, 179, 180, 186, 188, 221, 
657, and see analgesia, anesthesia, 
bioassay, isobole, synergism, thera- 
peutic index, toxicology 
Physiology, 275, 332, and see blood 
pressure, medecine 
Placebo, 405 
Plotting procedure, see graphics 
Poisson distribution, 79, 156, 415, 501, 
527, 569, 586 
-Pascal, 527 
truncated. 571 
Pooling, 144 
Population, see distribution, fish, sta- 
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tistical genetics 
dynamics, 349, 386, 584 
genetics, see genetics 
management, 175, and see ecology 
stable, 387 
stationary, 393 
Poultry, 489, 506, 617 
Power, see test 
discriminating, 11 
Prediction, see regression 
Preference, see organolepsis 
Probability, 330, 495, 595 
conditional, 300, 388 
cumulative, 603 
generating function, 527, 567 
joint, 62, 595 
Programming, see automatic computa- 
tion 
linear, 505 
Proportion, see binomial 
analysis of, 1 
Provisional value, 2 
Pseudofactor, 176 


Psychology, 379, 406, and see or- 
ganolepsis 

Qualitative data, 10, and see con- 
tingency, grouping, quantal re- 


sponse, score 
replacing normal, 22 
Quality, see organolepsis 
Quality control, 495 
Radioactivity, 507, and see half-life, 
radiology, tracer 
protection from, 179 
Radiology, 499, 508 
Randomization, 448, 
selection 
Randomized blocks, see balancing, in- 
complete blocks, interblock error 
augmented, 448 
Random mating, 232, 388 
Random variable, 69 
Range, studentized, 326, 541 
Rank, 255, 657, and see matrix, score, 
tie, transformation 
sum test, 175, 539 
Ratio, see proportion 
estimator, 267 
of Poisson variates, 501 


and see _ bias, 


“ Recombination, 150 
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Rectangular distribution, 278 
Recurrence formula, 532, 570, 587 
Regression, see change-over point, 
change-over value, correlation, co- 
variance, fitting regression line, 
structural 
analysis, see analysis of covariance 
asymptotic, 321, 651 
generalized, 120 
chain, 139 
cyclic, 139 
estimate, 267 
heterogeneity of, 639 
model, 122, and see structural 
parent-offspring 
two-phase, 634 
Rejection area, see critical region 
Rejection of data, 182 
Repeatability, 505, 553, and see genetic 
correlation, heritability 
Replication, 283 
efficiency of, 54 
fractional, 319 
Residual, see error 
effect, 405 
Response, see bioassay, dose response, 
model, time response 
quantal, 
multivariate, 657 
semi-, 180, 181 
Reviews, 160, 329, 495, 657 
Risk, competing, 57 
Sample size needed, 11, 48, 151, 324, 
501, 504. 541, and see organolepsis 
Sample, small, 283 
Sample survey, 502, and see sampling 
Sampling, 165, 169, 175, 176, 415, 497, 
and see components of variance, 
design of experiments, sample size 
needed, sample survey, truncation 
cluster, 504 
cost of, 617 
ecological, 617 
experiment, see sampling studies of 
statistical problems 
multistage, 272 
optimum, 617 
random, 269 
sequential, see sequential 
stratified, 270 
studies of statistical problems, 131, 
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182, 359, 541, and see Monte Carlo 
survey, 497 
systematic, 416 
Scale, 173, 253, 372, and see metameter, 
score 
are tangent, 189 
parameter, 503 
subjective, 408 
Score, 305, and see discriminant func- 
tion, metameter, organolepsis, rank, 
scale 
critical, 317 
method of, 128 
Screening test, 177, 181, 185, 186, 447, 
658, and see bioassay 
Selection, see balancing, design of 
experiments, genetic selection, or- 
ganolepsis, randomization, rejection 
of data, sampling 
bias, 505 
of data, 409, 553, 649 
of error term, see proper error term 
of samples, 364 
Sensory test, see organolepsis 
Sequential experiment, 48, 657, and see 
bioassay, quantal response 
truncated, 504 
Serology, 33 
Sign test, 326, 650 
Skewness, 528 
Sociology, 13 
Spearman estimate, 79 
Spearman-Karber estimate, 500 
Split plots, 144, and see strip plots 
Standard deviation, see variance 
Standard error, see variance 
Standardization, 33, 186, 657 
Standard treatment, 111, 452 
Statistics texts and periodicals, see 
reviews 
Steepest ascent (descent), method of, 48 
Strip plots, 148, and see split plots 
Stochastic differential equation, 508 
Stochastic process, 57, 169, 425, and see 
random variable 
Stochastic variable, 137 
Structural equation, 139 
Structure, 107, 137 
Subjective evaluation, 408, and see 
organolepsis 
Successive approximation, see iteration 
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Sufficient statistic, 2, 121, 585, and see 
efficiency, estimation 
Survival curve, 173, and see dose 
response curve, mortality, time 
response curve 
Survival time, see time response curve 
Symmetrical complementation design, 
499 
Synergism, 188 
Tables, miscellaneous, 52, 55, 91, 92, 
100, 126, 244, 245, 247, 291, 321, 
324, 408, 495, 496, 501, 507, 533, 
534, 535, 544, 546, 561, 593, 600, 652 
Taste test, see organolepsis 
Taxonomy, 331 
Taylor series, 122 
Test, see analysis of variance, chi 
square, comparison, confidence 
limits critical region, goodness of 
fit, order, rank, rejection of data, 
sequential, sign test, triangle test, 
t test 
asymptotic, 142 
distribution-free, 182 
exact, 133 
conditional, 595 
multiple comparison, 176, 324, 539 
multiple range, 
Dunean’s, 321, 540 
Newman-Keuls, 540 
Tukey’s, 540 
multivariate, 496, 541 
nonparametric, 183, 539, 657 
of boundedness, 166 
of distribution, 507 
of significance, 
of difference between regressions, 
637 
largest difference, see multiple range 
test 
power of, 176, 501 
sequential, 176, 178, 179, 622 
graphical, 649 
using range, see multiple range test 
Wilcoxon’s, 543 
power of, 182 
robustness of, 183 
Theory, see biometry, hypothesis, law, 
model 
Therapeutic index, 186, 658 
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Three-quarters-life, 127, and see half- 
life 
Tie, 649 
Time response curve, 634, and see dose 
response curve 
Time series, 391 
Toxicology, 186, 188, 501, 658, and see 
bioassay, pharmacology 
Tracer, 189, and see radioactivity 
Transformation, 1, 180, 209, and see 
additivity, analysis of variance, 
bioassay, discriminant function. 
logistic curve, matrix, model 
angular, 5, 372 
are sine, see angular 
inverse, | 
Laplace, 350 
logarithmic, 554 
logit, 1, 372, 501 
log log, 1 
power, 498 
probit, 94, 372, 501 
rankit, 251 
Trend, see regression 
Triangle test, 251 
Truneation, 57 
T test, 181, and see confidence limits, 
Hotelling’s T 
multiple, 324 
non-normal data, 182 
power of, 182 
Tuberculin, 33, and see Mantoux 
Tumor, see cancer 
Variance, see covariance, genetic 
adjusted, 112 
analysis of, 258, 450, 562, 659, and 
see additivity, analysis of co- 
variance, analysis of groups of 
experiments, chi square, compo- 
nents of variance, error, fitting 
constants, interaction, interblock 
error, least squares, model, multi- 
variate analysis, path coefficients, 
regression, structural analysis, 
test, transformation 
model II, 175 
of differences, 164 
random effects, 608 
approximate, 95, 368 
asymptotic, 58, 558 
components, 145, 438, 490, and see 
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components of covariance, cumu- 
lant components, structural 
analysis 
computation of, 607 
genetic, 233 
variance of, 361 
error, 506 
estimated, 273 
genotypic, 232 
homogeneity of, 90 
interblock, see information 
matrix, see covariance matrix 
of difference of adjusted means, 116 
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of difference of means, 452 
of genetic correlation, 359 
of product, 172 
of proportion, 31 
of regression coefficient, 286 
of scale parameter, 503 
proportional to power of mean, 498 
Vital statistics, 167 
Weibull, see distribution 
Weighting, 2, 122, 300, 481, 581 
choice of, 288 
Working value, 2 
X-ray, see radiology 


| 
, 
| 


nn 


jnuuiit 
3 

= 
wu 
jun 


The Biometric Society 


Co 


FounDED BY THE Biometrics SECTION OF THE AMERICAN STATISTICAL ASSOCIATION 


TABLE oF CoNTENTS 


Articles 


Marginal Percentages in Multiway Tables of Quantal Data with Dispropor- 
iON ATePUTEQUCN CLES aces Wer ssey ae ee eet a Were Ie CY F. Yates 


Some Classification Problems with Multivariate Qualitative Data 
William G. Cochran and Carl E. Hopkins 


The Standardisation of Tuberculin Hypersensitivity 
M. Stone and R. A. Bruce 


Optimum Estimation of Gradient Direction in Steepest Ascent Experiments 
Samuel H. Brooks and M. Ray Mickey 


A Stochastic Study of the Life Table and Its Applications III. The Follow-Up 
Study with the Consideration of Competing Risks. ...Chin Long Chiang 


The Spearman Estimator for Serial Dilution Assays 
Eugene A. Johnson and Byron Wm. Brown, Jr. 


The Fitting of a Generalization of the Logistic Curve........... J. A. Nelder 


Combined Analysis of Balanced Incomplete Block Designs with Some Common 
ERREH UM CM bs eee end ors ona ERT Cae ee Ocho M. V. Pavate 


Generalized Asymptotic Regression and Non-Linear Path Analysis 
Malcolm E. Turner, Robert J. Monroe, and Henry L. Lucas, Jr. 


On the Analysis of Split-Plot Experiments.................. H. Leon Harter 


Queries and Notes 


Confidence Intervals for Recombination Experiments with Microorga- 
SEVIS a DSI, Bee ghee Gat Ree Hain GSR GCG CTA A. W. Kimball 


Estimation of Proportion from Zero-truncated Binomial Data 
G. N. Wilkinson 


Book Reviews 


Pea Gumbelastabisties Of HiremMesiauy rons cles a B. F. Kimball 
Vessereau, A.: Méthodes Statisques en Biologie et en Agronomie 
P. Robinson 
Love, A. G., Hamilton, E. L., and Hellman, I. L.: Tabulating Equipment 
ANG VATMUVeVLG AIGA SLAIN ICS es rercrin ite cis cies neck W. W. Holland 
Abstracts 


The Biometric Society 


News and Announcements 


_ 


120 


150 


153 


160 
162 
163 
164 
190 
196 


Number 1 March 1961 Volume 17 


BIOMETRICS 
Editor 
Ralph A. Bradley 


Editorial Borad 
Editorial Associates and Committee Members: C. I. Bliss, Irwin Bross, E. A. 
Cornish, 8. Lee Crump, H. A. David, W. J. Dixon, Mary Elveback, D. J. Finney, 
W. A. Glenn, J. W. Hopkins, O. Kempthorne, Leopold Martin, Horace W. Norton, 
S. C. Pearce, J. G. Skellam, and Georges Teissier. Managing Editor: Ralph A. Bradley. 
Former Editors 
Gertrude M. Cox—Founding Editor 


John W. Hopkins—Past Editor 


Officers of the Biometric Society 
General Officers 


President: Leopold Martin 


Secretary: M. J. R. Healy; Treasurer: M. A. Kastenbaum 
Council: M. 8. Bartlett, W. U. Behrens, C. I. Bliss, A. Buzzati-Traverso, D. G. 
Chapman, W. J. Dixon, C. W. Emmens, D. J. Finney, C. G. Fraga, Jr., Anne Lenger, 
C. C. Li, A. Linder, K. Mather, C. R. Rao, P. V. Sukhatme, G. Teissier, J. W. 
Tukey, G. 8. Watson, F. Yates. 


Regional Officers 


Region President Secretary Treasurer 
Australasian M. Belz W. B. Hall G. W. Rogerson 
Belgian and Congo J. Henry L. Martin A. H. L. Rotti 
Brazilian A. M. Penha A. Conagin A. Groszmann 
British J. A. Fraser Roberts C.D. Kemp P. A. Young 


E.N. American Oscar Kempthorne Erwin L. LeClerg Donald A. Gardiner 
(Henry L. Lucas) 


French Ph. L’ Heritier D. Schwartz D. Schwartz 
German O. Heinisch R. Wette M. P. Geppert 
Italian G. Montalenti R. Scossiroli F. Sella 

W.N. American W. A. Taylor W. A. Becker Bernice Brown 


National Secretaries 


Denmark N. F. Gjeddebek Netherlands H. de Jonge 
India A. R. Roy Norway L. K. Strand 
Japan M. Hatamura Sweden H. A. O. Wold 


Switzerland H. L. LeRoy 


MARGINAL PERCENTAGES IN MULTIWAY TABLES 
OF QUANTAL DATA WITH DISPROPORTIONATE 
FREQUENCIES 


F. YATES 


Rothamsted Experimental Station, 
Harpenden, Herts., England 


SUMMARY 


When multiway tables of quantal data are analysed by transforming 
the percentages and fitting constants to the transformed variate by 
the method of maximum likelihood, direct inverse transformation of 
the values of the fitted constants gives percentages which may deviate 
widely from those observed. The reasons for this are explained and 
methods are given for obtaining correct percentages. 


INTRODUCTION 


Transformations, such as the logit and log log, are now being used 
increasingly for the analysis of multiway tables of quantal data. ‘The 
basic procedures have been described by Jolly [1950], Dyke and Pat- 
terson [1952] and Yates [1955]. Their earlier use for this purpose was 
undoubtedly inhibited by the heavy numerical computation required, 
but the burden of this has been greatly lightened by the introduction 
of electronic computers. The Rothamsted computer, for example, has 
a programme for analyses of this type which will handle tables with 
up to five factors in which the number of cells and marginal means 
does not exceed 256. Constants representing the main classifications 
and any desired sets of sub-classifications can be fitted, and the values 
of the constants and the reduction in variance are determined. 

When an analysis of this kind has been completed, it is frequently 
desirable to present the results in terms of percentages, instead of, 
or as well as, in terms of the transformed variate. If the constants 
obtained.in the analysis are retransformed directly to percentages, 
considerable distortions may occur. It is the purpose of this paper to 
explain why such distortions arise and how they may be removed. 


ESTIMATION OF A MEAN PERCENTAGE BY MEANS 
OF A TRANSFORMATION 


It is well known that, if a set of percentages is transformed by a 
non-linear transformation such as the logit transformation, and the 
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mean of the transformed values is retransformed back to a percentage, 
this percentage will not equal the mean of the original percentages. 
The ordinary maximum likelihood procedure of fitting, using pro- 
visional and working values and successive approximation, removes 
this distortion. 

This result follows immediately from the fact that the weighted 
mean of a set of percentages, with weights proportional to the fre- 
quencies, is the maximum likelihood estimate of the mean percentage. 
A direct proof may, however, be of interest, and illustrates the pro- 
cedure to be followed in the more complicated case of two-way tables, 
discussed in the next section. 

Suppose we have a set of proportions p, = x,/n, , where n, is the 
number of observations in class r and x, the number of those fulfilling 
the required condition, so that 100 p, is the corresponding percentage. 

If the observations in all the classes are in fact samples from the 
same population, the sufficient estimate of the population proportion is 


b= >) th) hen a 


If, with any transformation y = {(p), with inverse p = F(y), a common 
provisional value Y is taken, and Y,,,, and Yi, are the maximum and 
minimum values and w the weighting coefficient for this value of Y 
(as defined in the ordinary maximum likelihood procedure), then the 
working value for class r will be 
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The weighted mean of these values, which will provide a first estimate 
of the adjusted mean, and also a new provisional value, will be 


min 
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If successive approximations are now carried out until there is no change 
in the provisional value, we shall arrive finally at the equations 


Me = Dinas ae Gat: ) 
Val, 


(1) 


where Y, Yinax and Yai, are the final values of Y’ and the corresponding 
maximum and minimum. If P = F(Y), the formulae for Y,,., and 
Vea 
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(See, for example, Fisher and Yates [1957], p. 14.) Hence equation (1) 
reduces to #Q — @P = 0, i.e. P = #, and therefore # = F(g). In other 
words the inverse transformation of the maximum likelihood estimate UT] 
gives an estimate # which is equal to the weighted mean of the original 
percentages, with weights proportional to the frequencies. 


TWO-WAY TABLES 


In the analysis of a multi-way table of quantal data the expected 
values of the transformed variate are assumed to be made up of additive 
components representing the various classifications which require to 
be taken into account. 

Thus in an R X S two-way table the expected value of the trans- 
formed variate for the cell (7, s) is given by 


I a Ns I Ponce alt 


where m is a general mean, a, , --: , a, is the set of constants for the 
first classification, and b, , --- , 0, the set for the second classification. 
The values of m and the a’s and 6’s are obtained by the maximum 
likelihood fitting. 

If the 7,, are transformed back to percentages 100 7,, , then 100 #,, 
will represent the expected percentages on the assumed hypothesis 
without distortion. Examination of the equations which are used to 
determine the fitted constants, on the lines of the last section, shows 
that in the case of the logit transformation the weighted means of 
‘prs » With weights equal to the original frequencies, will reproduce the 
marginal percentages of the original table exactly. This depends on 
the fact that for this transformation 


7) = const, 


dP 
which follows from the general formula for w, namely 


»= 79 (tr): 


With other transformations there will be some discrepancy. There 
is, however, no real inconsistency; the marginal percentages so obtained, 
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not those observed, are the appropriate estimates of these percentages 
for the adopted model, with the observed weights. 

To represent the estimates of the class-differences in terms of per- 
centages in compact form, we require the expected percentages of the 
margins of the table with some form of proportionate weights. As 
in the quantitative case it is usually best to give the percentages that 
would be expected if the frequencies in the cells of the table were 
proportionate to the marginal frequencies, 1.e. 


eg = itt) (ee 


where 
Nr, = Dy Mrs, ebC. 
8s 


These percentages can be computed directly from the percentages 
100 #,, with the appropriate weights, e.g. 


pe = >, Bran’, /N,, ) 
8 
PD ice ne 
8 


We will refer to this procedure as computation from the expected cell 
percentages. : 

Computation of the expected marginal percentages from the expected 
cell percentages, if done by hand, is somewhat tedious, particularly in 
large multiway tables, since the expected percentages of the individual 
cells have first to be computed and their weighted means then taken. 
The question therefore arises whether the values of the fitted constants, 
which are given directly by the computation, can be used to provide 
values of expected marginal percentages. 

In the analogous analysis of quantitative data the values of the 
fitted constants give direct estimates of the values the marginal means 
would assume if the cell frequencies were proportionate to the marginal 
frequencies. With exactly proportionate cell frequencies the values of 
fitted constants m + a, and m + 6, are given directly by the two sets 
of marginal means. In the case of quantal data, however, retrans- 
formation of the values of the fitted constants m + a, and m + b, 
will give percentages which may differ widely from those obtained by 
computation from the expected cell percentages. 

These differences are due to two causes. In the first place, since 
the fitted constants are in effect weighted means of the expected cell 
values of the transformed variates, the corresponding percentages will 
differ from the means, with the same weights, of the cell percentages. 
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In the second place the weights used in the fitting are the products 
of the cell frequencies and the weighting coefficients. The resultant 
fitted constants are therefore estimates of what the marginal means of 
the transformed variate would be if the weights were proportionate to 
the marginal sums of these weights instead of to the marginal fre- 
quencies. If the angular transformation, which has a constant weighting 
coefficient, is used, no differences will arise from this cause, but with 
other transformations it is frequently the major source of discrepancy. 

At first sight it might appear that the best course would be to seek 
a value m’ such that the quantities m’ + a, when transformed give 
percentages whose weighted mean, using the marginal frequencies as 
weights, is equal to the observed overall percentage. This can be done 
without difficulty by successive approximation, and appears to be 
reasonably satisfactory with the angular transformation. When the 
percentages are small and the logit transformation is used, a much 
closer approximation to the marginal percentages computed from the 
expected cell percentages is obtained by the very simple procedure of 
transforming m + a, to percentages and multiplying all these per- 
centages by a common factor, chosen so as to give the observed overall 
percentage, as is illustrated in the example below. This is to be expected 
since at the lower end of the percentage scale the use of the logit trans- 
formation implies that the cell percentages are representable by a 
product function of the form u, », . 

It may be noted here that even with the logit transformation the 
overall percentage computed from the expected cell percentages will 
differ from the observed overall percentage, owing to the change from 
actual to proportionate frequencies in the cells. The difference is not 
likely to be large, but for practical purposes it may be regarded as 
preferable to present marginal percentages whose weighted means are 
exactly equal to the observed overall percentage. or this purpose 
a proportional adjustment of all percentages (or their differences from 
100 when all percentages are large) will be adequate. 


EXAMPLE 


Table 1 gives the percentages of cows affected by milk fever in given 
lactations, classified by lactation number and season of calving, and 
Table 2 the numbers of calvings on which these percentages are based. 
These results were obtained in the course of a survey of diseases of 
dairy cattle. 

It is immediately apparent from the percentages that there is a 
much greater incidence of milk fever in the later calvings, and also 
a variation with season, January-April having the lowest incidence and 
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TABLE 1 
PERCENTAGES OF COWS AFFECTED BY MILK FEVER 


Lactation 
1-2 3 4 5 6+ Overall 
Jan—Apr 0.42 1.45 Sule Dol 6.07 2260 
May-July 0.17 3.08 Cheat 9.43 10.94 AL lg 
Aug—Sept 0.66 4.89 9.40 10.93 13.98 4.25 
Oct-Dec 0.35 2.93 6.07 9.61 12.19 3.50 
Overall 0.41 PTS ROS 8.06 9.38 3.46 


August-September the highest. The August-September maximum is 
somewhat masked in the marginal percentages, however, owing to the 
greater proportions of these calvings in the earlier lactations. 

Constants were fitted using the logit transformation with logs to 
base 2. The results shown in Table 3 were obtained. Inverse trans- 
formation of these logits gives percentages for seasons which are very 
much too high. This is mainly due to the low weighting coefficients 
associated with the low percentages of lactations 1 — 2. 

The expected logits for the individual cells can be computed from 
the values of Table 3, and transformed to percentages. Thus the logit 
for the top left-hand cell is —4.0121 — 2.3786 + 1.9874 = —4.4033, 
and the corresponding expected percentage is 0.2229. The discrep- 
ancies between the weighted marginal means of these percentages, 
with weights equal to the original cell frequencies (Table 2), and the 
observed marginal percentages, are all less than 0.001. 


TABLE 2 
NUMBERS OF CALVINGS 


Lactation 


1-2 3 4 5 6+ Total 

Jan-Apr 3806 2137 1744 1184 2010 10881 
May-July 2352 1006 706 488 841 5393 
Aug-Sept 3169 1003 617 366 522 5677 
Oct-Dec 5117 1740 1252 791 1042 9942 
Total 14444 5886 4319 2829 4415 31893 


MULTIWAY TABLES OF QUANTAL DATA 


TABLE 3 
Frrrep Constants (Loairs ro BAsr 2) AND CorrESPONDING PERCENTAGES 
Lactation m +a, % Season m+. % 
1-2 —4.0121 0.383 Jan-— —2.3786 3.566 
3 —2.5643 2.779 May- —1.9180 6.544 
4 —2.0055 5.840 Aug- —1.6682 9.009 
5 —1.7082 8.564 Oct- —1.9104 6.609 
6+ —1.5678 10.220 — = = 
m —1.9874 5.980 


Table 4 gives the various estimates of the expected marginal per- 
centages with proportionate weights. Line (1) gives the weighted means 
of the expected cell percentages, with weights equal to the marginal 
frequencies of Table 2. These give a small discrepancy in the overall 
percentage, and this is removed in line (2) by proportional adjustments. 
Line (8) is obtained by proportional adjustment of the percentages 


derived directly from the fitted constants (Table 3). 


TABLE 4 
MarGINAL PERCENTAGES COMPUTED BY VARIOUS METHODS 


The 


agreement, 


(1) From expected cell percentages, with proportionate weights 
(2) Line (1) adjusted to observed overall percentage 
(3) From fitted constants (adjusted) 


Lactation 
1-2 3 4 5 6+ Overall 
(1) 0.39 2.79 5.85 8.55 10.18 33 (0) 
(2) 0.37 2.65 5.65 8.11 9.66 3.46 
@) 0.36 2.63 5.54 8.12 9.69 3.46 
Season 
Jan May Aug Oct Overall 
(1) 2.19 3.99 46 4.03 SOO 
(2) 2.08 3.78 18 3.82 3.46 
(3) 2.06 3.78 5), Al 3.82 3.46 
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between lines (2) and (3) is very close. With the logit transformation 
and small percentages direct derivation from the fitted constants may 
therefore be regarded as quite satisfactory. 

Comparison of these expected percentages with the observed per- 
centages of Table 1 shows that the distortions in the observed marginal 
percentages due to disproportionate weights have been removed by 
the fitting. 

The residual sum of squares given by the fitting is 17.57. This 
corresponds to a x” with 12 df., giving .2 < P < .1. The data are 
therefore reasonably represented by additive components in the logit 
scale. 


MULTI-WAY TABLES 


If only constants representing the main classifications are fitted 
the procedure of proportional marginal adjustment is exactly the same 
for three- or more-way tables as for two-way tables. If, however, 
constants representing certain sets of sub-classes (analogous to the 
interactions of a factorial experiment) are also fitted, the situation 
becomes more complex. Nevertheless if the expected values of the 
transformed variate are available only for the margins of the table, a 
procedure analogous to that given above for a two-way table will 
probably be quite satisfactory, but certain minor inconsistencies in the 
margins relating to the main classifications may be expected. 


PROGRAMMING POINTS IN ELECTRONIC COMPUTATION 


Any programme for the analysis of multiway tables of quantal data 
which follows the ordinary maximum likelihood procedure of successive 
approximation must embody provisions for computing the expected 
values, in terms of the transformed variate, of the individual cells of 
the table, since these values constitute the provisional values for the 
next approximation. It must also contain provision for forming the 
weighted marginal means, with disproportionate and probably also 
with proportionate weights. Consequently, in order to calculate the 
expected cell percentages all that is necessary, in the way of additions 
to the programme, is to re-calculate the expected values of the individual 
cells after the last approximation (this will probably be done in any 
case), transform these to percentages, and calculate the marginal means 
of these percentages with weights proportional to the marginal fre- 
quencies of the main classifications. If desired a proportional adjust- 
ment can be made to these percentages so as to give the observed 
overall percentage. 

This procedure is likely to be just as simple, from the programming 
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point of view, as the alternative procedure of adjusting the marginal 
percentages proportionally and should therefore be adopted when writing 
a complete programme. The latter procedure is, however, of interest, 
since it makes it possible to deal expeditiously with results furnished 
by a programme for which no provision for inverse transformation has 
been made, or which only inversely transforms the expected marginal 
values of the transformed variate. 
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1, INTRODUCTION 


Since 1935, when Fisher’s discriminant function appeared in the 
literature, methods for classifying specimens into one of a set of uni- 
verses, given a series of measurements made on each specimen, have 
been extensively developed for the case in which the measurements are 
continuous variates: This paper considers some aspects of the classifi- 
cation problem when the data are qualitative, each measurement taking 
only a finite (and usually small) number of distinct values, which we 
shall call states. Our interest in the problem arose from discussions 
about the possible use of discriminant analysis in medical diagnosis. 
Some diagnostic measurements, particularly those from laboratory tests, 
give results of the form: —, + (2 states); or —, doubtful, + (38 states); 
or (with a liquid), clear, milky, brownish, dark (4 states). 

With qualitative data of this type an optimum rule for classification 
can be obtained as a particular case of the general rule (Rao, [1952], 
Anderson, [1958!). The rule is exceedingly simple to apply (Section 2). 
In practice, qualititative data are frequently ordered, as with —, 
doubtful, +. The classification rule discussed in this paper takes no 
explicit advantage of the ordering, as might be done, for instance, by 
assigning scores to the different states so as to produce quasi-continuous 
data. The best method of handling ordered qualitative data is a subject 
worth future investigation. 


1This work was assisted by a contract between the Office of Naval Research, Navy Department, 
and the Department of Statistics, Harvard University by a Special Research Fellowship of the Na- 
tional Institutes of Health, and by a gift from the Lilly Research Laboratories to the Department 
of Biostatistics, University of Oregon Medical School. Reproduction in whole or in part i is permitted 
for any purpose of the United States Government. 
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This paper seeks answers to three problems that arise in the use of 
the proposed rule. 


(1) The effect of the initial sample sizes on the performance of the proposed 
classification rule. 


In order to construct a rule, we must have preliminary data on some 
specimens known to be classified correctly, since the rule depends on 
the joint frequency distributions of the measurements within each 
universe. Standard classification theory assumes that these distributions 
are known exactly, although in practice it is sometimes difficult to 
obtain adequate samples from which to estimate them. The conse- 
quences of constructing a rule from preliminary samples of finite sizes 
are discussed in Sections 3, 4, and 5. 


(2) The relative discriminating power of qualitative and continuous variates. 


Classification is simpler with qualitative than with continuous 
measurements. For this reason, if a few of the available measurements 
are continuous while the rest are qualitative, we may be inclined to 
transform each continuous measurement into a qualitative one by 
partitioning its frequency distribution, provided that this does not 
result in too much loss of discriminating power. This consideration led 
us to investigate the questions: if a normally distributed variate is 
transformed into a qualitative one with s states, how much discrimi- 
nating power is lost, and what are the best points of partition of the 
curve from the point of view of retaining maximum discriminating 
power (Sections 6 and 7)? 


(3) Use of classification experience for improvement of the rule. 


The optimum rule depends on the relative frequencies 7 , m2 , 73 etc. 
- with which specimens from the different universes present themselves 
for classification. The initial estimates of these frequencies, made from 
previous data or by judgement, may be biased. After a number of 
specimens have been classified by the rule, it-is possible to re-estimate 
the frequencies 7, from the data for these specimens, so that the classi- 
fication rule may be improved (Section 8). 


2. AN OPTIMUM RULE FOR CLASSIFICATION 


Ideally, the setting up of an optimum rule demands three different 
kinds of preliminary information. Firstly, we require the joint frequency 
distribution of the measurements in each universe under consideration. 
With continuous variates the most common assumption about these 
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distributions is that they are multivariate normal with the same dis- 
persion matrix in each universe. With qualitative variates, suppose 
that the jth measurement takes s; distinct states. When this measure- 
ment is made on a specimen, we learn into which state it falls. A 
series of qualitative measurements, e.g. one with 2 states, one with 3 
states and one with 4 states, defines 24 cells or multivariate states. 
For any specimen, these three measurements tell us which of the 24 
states the specimen occupies. Thus, in general, a set of k qualitative 
measurements classifies the specimen into one out of a number S = 
8,8, °++ 8, multivariate states. Consequently the joint frequency dis- 
tribution of the measurements is completely specified for the wth uni- 
verse if we know the probabilities p,; that the specimen falls into the 
ith multivariate state where, for each u, 


S 
Do Pui = il, 


Secondly, practical use of a classification rule is likely to result in 
some mistakes in classification. In some applications certain types of 
mistakes are more serious than others. If the relative costs of different 
kinds of mistakes can be estimated, the rule should take them into 
account. Let c,, be the cost incurred when a specimen which actually 
belongs to universe V is classified as belonging to universe U. 

Thirdly, when the rule is put into use, the specimens to be classi- 
fied may not come with equal frequency from the universes. For 
instance, in a diagnostic screening test, most of the subjects who present 
themselves may be free from the disease in question. Let x, denote 
the relative frequency with which specimens come from the uth universe, 


where 
Ds le 


As might be expected, the optimum rule depends on these frequencies. 

With this background we now construct an optimum rule. For a 
specimen which falls into the 7th multivariate state, we select the 
universe to which it is to be assigned by minimizing the expected cost 
of mistakes in classification. Suppose that the specimen is assigned 
to the uth universe. For specimens that actually come from this 
universe, no mistake is made. Specimens from the vth universe will 
present themselves with relative frequency +, , and will fall into the 
ath state with relative frequency 7,p,; . The expected cost of mis- 
classifying these specimens into the uth universe is therefore 7,p,.C.. 
Hence the total expected cost of mistakes in classification is 
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omer ce 
vu 

To apply the optimum rule we compute this quantity for every u 
and assign specimens to the universe for which the quantity is a mini- 
mum. 

There are several particular cases of the rule. If c,, = Caeehe: 
the cost of misclassifying depends only on the universe from which the 
specimen comes and not on that into which it is misclassified, the total 
expected cost over all m universes is 


™ 


Si TP rily — 55 T yP vil, a TuPuiCu : 


vu v=1 


Since the first term on the right does not depend on u, specimens falling 
into the 7th state are assigned to the universe for which the triple 
product 7.p.:c, is greatest. 

If the relative costs c, are taken as equal, the rule minimizes the 
expected frequency of mistakes in classification. If, further, the relative 
frequencies 7, are also equal, any specimen found in the 7th state is 
assigned to the universe u for which the conditional probability p,, 
is greatest. 

For a numerical example we are indebted to Dr. Leslie Kish. The 
data come from a large study of voting behavior conducted by the 
Survey Research Center, University of Michigan. (Stokes et. al. [1958]). 
By open-end questionnaires taken during the 1952 and 1956 elections, 
voters were rated as Democrats (D), Independents (J) or Republicans 
(R) and also as Unfavorable (U), Neutral (V) or Strongly Favorable (F) 
to Eisenhower’s personality. These are the two predictor measurements, 
each with 3 states, making 9 bivariate states. The voters were also 
asked whether they voted for Stevenson or Eisenhower, these being the 
two universes. The sample sizes were 1003 and 1439. 

For illustrative purposes we have set up a rule classifying the 
subjects as Stevenson or Hisenhower voters by means of the predictor 
variates. The results of the classification are then compared with the 
actual voting behavior. 

Table 1 shows the relative frequencies p,; and p.; of the 7th state 
for Stevenson and Eisenhower voters respectively. We assumed that 
Cc, = c, and that 7, = 7. = }. The resulting classification rule is given 
on the right in Table 1. . Persons falling into the states DU, DN, DF 
and JIU are classified by the rule as Stevenson voters, since in these 
states Dis > Poi - 

The estimated probabilities of misclassification are easily obtained. 
Any Eisenhower voter is misclassified if he falls into the states DU DN, 
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DF or IU. Hence the frequency of misclassification for Eisenhower 
voters is 


.033 + .079 + .090 + .017 = .219. 


The figure for Stevenson voters is 0.133 and the average is 0.176. 

As with continuous measurements, it is possible to examine whether 
a measurement contributes to the classification to a worthwhile extent. 
If political affiliation alone (D, J, R) is recorded, the probabilities in 
the three states are given in the lower part of Table 1, being obtained 
of course by addition from the upper part of the table. The probabilities 
of misclassification are now 0.201 for Stevenson voters and 0.202 for 
Eisenhower voters. Thus the addition of the attitude measurement 
reduces the average number of mistakes by about 13 percent. 

The use of qualitative variates in attempting to date certain of the 


TABLE 1 
ILLUSTRATION OF THE CLASSIFICATION RULE FoR Two MEASUREMENTS 


Resulting 
Pri Pri classification 
State (i) Stevenson Eisenhower rule 
DU 847 .033 Ss 
DN .359 .079 SS) 
DF .094 .090 iS) 
IU .068 .017 8 
IN .078 .107 E 
IF .026 .154 E 
RU) .002 .017 E 
RN .023 .176 E 
RF .004 Sia E 
1.001 1.000 


ProBaBitities Usine PourricaL AFFILIATION ALONE 


State Stevenson Hisenhower Classification 


D .800 .202 


Ss 
I 172 .278 E 
R .029 .520 i 


a ee ee ee ee 
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works of Plato (i.e. to arrange a number of universes in a time sequence) 
has been discussed by Cox and Brandwood [1959]. 

In the rest of this paper discussion will be confined unless otherwise 
mentioned to the case of two universes, with c, = c, = land 7, = m7, = i. 


3. EFFECTS OF THE FINITE SIZES OF THE INITIAL SAMPLES 


The initial sample from each universe serves two purposes. It is 
used to set up the classification rule and to estimate the probabilities 
of misclassification for specimens from the two universes, so that we 
can decide whether the classification is accurate enough to be satis- 
factory. 

Some notation will be needed. To avoid double subscripts, let U, 
U’ denote the two universes, and p; , p/ the true probabilities that a 
specimen will fall into the 7th multivariate state. Independent samples 
of sizes n, n’ are drawn from the universes. The numbers of specimens 
falling into the 7th state are 7; , r; , and the corresponding estimated 
probabilities are #; = r,;/n, p; = r//n’. Since the true p; , p{ are 
unknown, the actual classification rule places a specimen in U if pf; > p/, 
in U’ if pf; < pf. If pj; = p%, the decision is made by tossing a fair coin. 

For given sample sizes n, n’ the values of #; and #/ will vary from 
sample to sample. Thus the actual classification rule and its per- 
formance may change from sample to sample. In describing the 
consequences of finite sample sizes, we usually present the average 
results over all initial samples of given sizes n, n’. 

The principal consequences of the finite sample sizes are as follows. 

1. The probability of misclassification as estimated from the samples 
is, of course, subject to a sampling error that in moderately large samples 
may be shown to be approximately of the binomial type. 

2. On the average, taken over all pairs of samples from a given 
pair of universes, the actual probability of misclassification is greater 
’ than the theoretical optimum: i.e. than the probability that would 
hold if the true p; , p; were used to construct the classification rule. 

3. The average estimated probability of misclassification is less than 
the theoretical optimum probability. From 2, it is also less than the 
average actual probability. 

In other words, finite sample sizes involve two types of penalty. 
Owing to sampling errors in the #; , 6; , the rule obtained may not be 
the theoretical optimum, and the probability of misclassification is 
underestimated from the sample data. 

These results may be illustrated, in exaggerated form, by con- 
sidering samples of size n = 1 from universes having two states with 
the following true probabilities of falling into the states. With the 
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optimum rule the probability of misclassification is 0.1 for specimens 
from U, 0.05 for specimens from U’, the average being 0.075. 


Optimum | 
State U We classification | 
1 0.9 0.05 U 

2 OR 0.95 (gl 


With n = 1, only four types of sample result are possible. Table 2 
shows for each type the classification rule that would be set up, the 
estimated probability of misclassification, and the actual probability 
that would pertain if that rule were used. A ? means that classification 
is a toss-up. | 

To explain the entries in Table 2, consider the last type of result. 

The wrong classification is made in both states. Nevertheless, if we 


TABLE 2 
THE CLASSIFICATION RULE FOR SAMPLES OF SIZE 1 


Frequency Probability of misclassification 
Sample result of occurrence Estimated Actual 
State | U U’ | Rule 
sat 
peal Owe (0.9)(0.95) 
OE AOL Mae Lee =0.855 0.0 0.075 


— 

— 

Be) 
—> 
i= 
Ro) 
VS 
— 


1 0.05) 

2 ? = 0.045 0.5 0.500 
| | 
| 

1 TO SaOMe ce (0.1)(0.95) 

> ae Ws a al Kose = 0.095 0.5 0.500 
| | 
| | 

VA ean kage (0.1)(0.05) 

2 le COS = 0.005 0.0 0.925 


Average 0.07 0.13875 
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believe the samples, we estimate a zero probability of misclassification. 
The actual probabilities are 0.9 for U and 0.95 for U’, giving an average 
of 0.925. 

The overall average actual probability of misclassification is 0.13875 
as compared with a theoretical optimum of 0.075 and an average 
estimated value of 0.07. 

The fact that the average actual probability exceeds the theoretical 
optimum arises, of course, because when p; > p/ , there will be some 
pairs of samples in which #; S {. Whenever this occurs, the actual 
rule for this state is not as good as the optimum. 

The fact that the average estimated probability lies below the 
theoretical optimum probability may be shown as follows. If p; > p{, 
the contribution of this state to the overall theoretical probability of 
misclassification is }p{ . Since r//n’ is an unbiased estimate of p/ , 
the contribution may be written 3H(r//n’), where E denotes a mean 
value. The estimated contribution, on the other hand, is 3 the average 
of the smaller of r,;/n, r//n’, which will always be less than 3 (r//n’). 

The bias in the estimated probability and the inferiority of the 
actual to the optimum rule are due primarily to states in which p; 
differs little from pi . These difficulties can be largely avoided by 
withholding a decision, i.e. making no classification, in such states. 
For instance, suppose that preliminary samples of size 64 give the 
following results for a variate with four states. 


No. of specimens 

State U U’ 
1 51 1 

2 8 5 

3 3 5 

4 2 53 
64 64 


States 1 and 4 provide good discrimination, but in states 2 and 3 
the estimated probabilities of a correct classification are only about 
0.6. Consider a recommended rule to withhold judgement when a 
specimen occurs in state 2 or 3. From the samples, we have an unbiased 
estimate 21/128, or 16 percent, of the proportion of specimens for which 
the rule makes no classification. This estimate is binomial, with stand- 
ard error +~/(0.16)(0.84)/128. When a decision is made, the probability 
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of misclassification is estimated as 3/107, or about 3 percent. ‘This 
estimate will be almost exactly a binomial variate, with negligible bias. 

If, on the contrary, a decision is made in all states, we would like 
to obtain from the initial samples a reasonably unbiased estimate of 
the probability of misclassification for the actual rule developed from 
the samples. An estimate of the difference between the actual and the 
theoretical optimum probability of misclassification is also of interest, 
particularly when it is not too costly to increase the size of the initial 
samples. These topics are discussed in Sections 4 and 5. 


4. ESTIMATION OF THE THEORETICAL 
OPTIMUM PROBABILITY 


As shown in Section 3, the sample probability of misclassification 
is a negatively biased estimate of the theoretical optimum probability. 
For a given value of (p; + p/), it appears intuitively that the under- 
estimation will be greatest when p; = pi. In this case the bias in 
the 7th state is in absolute value, 
2p: — {3H min. (pf; , p1)} 
where 7; , #% are independent binomial estimates of p; . An algebraic 
expression for this quantity as a function of p; , can be developed 
from the binomial distribution, but is unwieldy unless n, n’ are small. 
An approximation to the bias may be obtained by regarding 9; , p/ 
as normally distributed. Since the average value of the smaller of 
two normal deviates is known to be —0.56, the bias in the estimate 
of p; when both samples are of size n is approximately 


—0.56 V 0:9: /N. 


With p; = pt = 0.05, n = 50, for instance, this estimate gives —0.0173 
as against the correct value of —0.0170. 

Since this problem will occur mainly in states with low frequencies 
(unless the classification rule is poor), a relatively crude sample ad- 
justment for bias should be adequate for estimating the overall theo- 
retical optimum probability of misclassification. One suggestion is to 
use (f; + #/)/4 instead of one-half the smaller of #; , 6% when estimating 
the probability of misclassification, this adjustment being made in 
states in which 9; , #/ do not differ significantly at, say, the 20 percent 
level of significance. In the numerical example above, the adjustment 
is made in states 2 and 3. It amounts to estimating the theoretical 
optimum probability as 


14+6.5+4+4+2 135 
128 Were 


= 10.5% 


| 
| 
| 
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as compared with the unadjusted estimate of 11/128 or 8.6 percent. 

Examination of individual cases indicates that this adjustment 
removes most of the bias in states in which p, = pi. When p; ¥ p’ 
the adjustment tends to over-correct, and on the whole is likely to be 
conservative. 


5. THE AVERAGE INCREASE IN THE ACTUAL PROBABILITY 
OF MISCLASSIFICATION 


The average increase in the actual over the theoretical probability 
of misclassification may be expressed as a function of the p; , p and 
the sample sizes n (assumed equal). In the ith state, if p; > p{, the 
optimum rule gives no misclassification for specimens from U, but 
misclassifies the proportion p‘ of specimens from U’ that are in this 
state. The average frequency of misclassification is therefore 3p/ . 
With the samples, there is no increase in this frequency provided that 
r; >r.. Ifr; =ri,wetossacoin. This gives an increase in frequency of 


a(p: + pi) — apt = Fp: — p)- 


If r; < ri, we make the wrong decision, with an increase in frequency 
of mistakes of 


3p: — yp: = 3(p; — pi). 


Hence, over all states for which p; > p{ , the total expected increase 
in frequency of misclassification may be written 


£2, @e— pi Pr. ep <r) 4 Pre G; = 71)}), (5.1) 


with an analogous expression for the states in which p; < p{. 

The probability inside the curly brackets may be obtained from 
tables of the binomial distribution. The normal approximation to 
this quantity is surprisingly good, even for moderate samples and 
small p; , p;. With this approximation, the estimate is the probability 
that a normal deviate is less than 


= Vn | ( S pi | (5.2) 
V DiGi + DG: 


No correction for continuity is applied, because of the presence of the 
term 3Pr. (r; = 1’) in the exact expression. As an example of the 
accuracy, with p; = 0.03, p; = 0.01, n = 100, the normal approxima- 
tion gives 0.1555 as compared with the exact value 0.1567. 

Use of the normal approximation is illustrated in Table 3. From 
the p; , #4 for each state the normal deviate is computed from (5.2) 
and the probability is read from the normal tables. The difference 
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TABLE 3 


Estimation or Acruat Minus Optimum PROBABILITY 
or MIScLASSIFICATION 


No. of specimens 


State U We N.D. Probability |p: — p's | 
1 51 1 Large 0.0 — 
2 8 5 —0.880 0.189 0.0469 
3 3 5 —0.732 0.232 0.0312 
4 2 53 Large 0.0 = 
Total 64 64 


between the average actual and the theoretical probability of mis- 
classification is estimated as 


4{(0.189)(0.0469) + (0.232)(0.0312)} = 0.0080 = 0.8%. 


At the end of Section 4 the theoretical probability of misclassification 
was estimated as 10.5 percent. Adding 0.8 percent, we estimate the 
expected actual probability as 11.3 percent. 

In order to obtain a general impression of the effect of sample size, 
some study was made of expression (5.1) for the average increase of 
the actual over the theoretical optimum probability, as a function of 
Pp: , p;,n, and S, the number of states. The difficulty in studying this 
function lies in the substantial number of parameters involved when 
the states are numerous. A preliminary search was made for a smaller 
number of parameters which might dominate the performance of the 
function. It became evident that the function depends on the overall 
optimum probabilities of misclassification, i.e. on the quantities 

(Ae os, Pi; Pe Ss UA 
Di<p'i pli<pei 
As P, P’ diminish, i.e. as the optimum rule becomes better, the average 
increase also diminishes, i.e. the actual rule is less likely to differ from 
the theoretical optimum. Even with the quantities P, P’, n and S fixed, 
the average increase may vary from almost zero up to a maximum, 
depending on the way in which p; ,.p{ distribute themselves among the 
states. We decided to tabulate the maximum increase for given P, 
P’,n and S, although aware that this might produce an unduly pessi- 
mistic view of the effect of finite sample size. The worst possible 
distributions of the p; , p{ were found by trial and error. In most 
situations, though not in all, the worst case occurs when the probability 
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P’ is distributed equally among all but one of the states for which 
Di 2 p;. Asan example, for 8 states, with P = P’ = 0.1, the worst 
distribution in the first four states is as follows. 


Pi Di 
.69 0 
07 .0333 
07 .0333 
07 .0333 
Table 4 shows the maximum average increases for n = 50, 100; 


S = 4, 8, 16, 32; P = P’ = 0.05. 0.1, 0.2, 0.3, 0.4. 


TABLE 4 
Maximum AVERAGE INCREASE IN PROBABILITY OF MISCLASSIFICATION 
i = 1X0) 7 = 100 
Opt. Prob. Number of states Number of states 
ai 4 8 16 32 4 8 16 82 
0.05 .009 .018 .032 .059 .006 Oia! .020 .034 
0.1 LI .022 .039 .067 .008 .015 1025 .041 
OeA4 .014 027 .049 .082 .010 .020 .033 .052 
0.3 .016 .034 057 .098 O11 .023 .038 .062 
0.4 .025 ORY .061 O71 .012 .026 .041 061 


As mentioned previously, the average increase is smallest when 
P, P’ are small. It becomes larger as the number of states increases, 
being somewhat less than doubled for each doubling of the number of 
states. The average increases for n = 100 are roughly 1/ 4/2 times 
those for n = 50. Some irregularities in the table will be noticed for 
P = 0.4. The explanation is that the actual rule always gives at 
least 50 percent correct classifications. This imposes an upper limit 
to the maximum average increase for P = 0.4. 

It is difficult to give general recommendations from this analysis. 
The following comments may be appropriate. 

1. Preliminary samples of around 50 specimens from each universe 
should give a good indication of those multivariate states in which 
the classification will be accurate, and of those in which it will be 
dubious. 
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2. If a classification is to be made for all multivariate states, the 
estimated probabilities of misclassification should be increased by the 
two adjustments for bias. 

3. If the number of multivariate states does not Rae 8, Table 4 
suggests that samples of size 50 should make the actual mae satis- 
factorily close to the optimum in most situations. With 8 states and 
n = 50, the actual probabilities in the worst cases are 6.8, 12.2, Pipe 

‘and 33.4 percent, for theoretical optimum probabilities of 5, 10, 20 and 
30 percent, respectively. For larger numbers of states, substantially 
greater samples are indicated. 


6. THE EFFECT OF REPLACING A NORMAL VARIATE 
BY A QUALITATIVE VARIATE 


When some variates are continuous and some qualitative, Rao’s 
general rule for classification still applies. However, it presents diffi- 
culties in application, as Linhart [1959] has pointed out, since a dif- 
ferent continuous discriminant is required for each multivariate state 
defined by the qualitative variates. 

In this section the loss of discriminating power when normally 
distributed variates are partitioned into qualitative variates is examined. 
It is assumed that the variates are independent. This assumption is 
unrealistic for applications, but we have not been able to reach general 
results with correlated variates. With independent variates, simple 
asymptotic results can be obtained when the number of variates becomes 
large, and can be checked against the exact results for a small number 
of variates. 

Let the vth continuous variate be denoted by y, , v = 1, --- , ki), 
and the two universes by A and B. The variate y, is normal, with s.d. 
unity in each universe, and with mean 0 in A and 6, in B, the quantity 
6, being ‘‘the distance apart” of the two populations. The larger 6, 
is, the better y, is for classification. 

Given k independent variates, it is easy to show that the best 
continuous discriminator, when divided so that its standard deviation 
is 1, is the quantity 


SED DAHA SOD (6.1) 
For this discriminator the distance between the two populations is 
eS ‘S 6,- As k becomes large with all 6, > 0, the distance becomes 
so great that classification is almost perfect. In order to discuss the 
situation in which some mistakes in classification are made, we assume 


that as k becomes large the 6, tend to zero in such a way that VW ye é. 
remains finite. 
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The continuous classification rule assigns a specimen to B if R, > 
3 Vv > 6, - Since R, is normally distributed, the probability of mis- 
classification, which is the same for both universes, is the probability 
that a normal deviate is greater than iV> ® : 

Let each continuous variate be partitioned into a qualitative variate 
with six states. Results for fewer than six states are derived as particular 
cases of the result for six states. For the vth variate, let Pr. (j, | A), 
Pr. (j, | B) be the probabilities that a specimen from A and B respec- 
tively falls into the j,th state. The 6* multivariate states into which 
the specimen can fall may be specified by recording the states j, , 
Je , -*+ , jx for the individual variates. Since the variates are inde- 
pendent, the probabilities that a specimen falls into a given multivariate 
state are 


I] Pr. (j, | A) and II PreeGs iB). 


v=1 


Consequently the optimum discrete rule places a specimen in A if 


k k 
I] Pr. @| A) > I Pr. G. 1B), 


1.6. 1f 
k 


> {log Pr. (j, | A) — log Pr. (j, | B)} > 0. (6.2) 


This may be written 
Dia 0 (6.2)’ 
where 
x, = log Pr. (j, | A) — log Pr. (j, | B). 


The quantity >> «x, is a discrete random variable with 6° possible 
values, since its value depends on the multivariate state into which 
the specimen falls. It will be shown that as k becomes large, this 
quantity tends to be normally distributed. 

The way in which an individual variate is partitioned is shown in 
Figure 1. The six states are labeled, for mnemonic reasons, A, a, a, 
6, b, B: they might be called “strongly, moderately, slightly favorable 
to A”, etc. The partition involves two disposable numbers u, , Us , 
which determine the sizes of the regions a, 6 and a, b. (Four numbers 
might be used, but symmetry suggests that two will give the best 
partition.) Strictly, the numbers should be denoted by 4, , Us» ; 
since it may be profitable to vary them with »v, but for simplicity the 
subscript v will be omitted from u, , v2 and 6 at present. 
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Universe A Universe B 


FIGURE 1 
Partition or A NoRMAL VARIATE InTOo Srx StaTEs 


For an individual variate v Table 5 shows the notation used for the 
probabilities of falling into the respective states. 

The probabilities p; are integrals over the parts of the normal curve 
indicated in Figure 1. Note that the Pr. (| B) values are the same as 
the Pr. (| A) values in reverse order. 


TABLE 5 
PROBABILITIES THAT A VARIATE WILL FALL INTO THE 6 STATES 


State 
A a a B b B 
Pr. (| A) pr p2 Ds pa ps Pe 
1Psa, (Ci 135) Ds Po ps Ds p2 pi 
x W W2 W3 —W3 —W2 = Wy 


The last line of Table 5 shows the six possible values taken by 
the random variable 


z = log Pr. (| A) — log Pr. (| B) 
where 
w, = log (pi/pe), We = log (p2/ps), ws = log (ps/p,). 
It follows from Table 5 that 
E(z | A) = wi(p, — po) + wa(p2 — ps) + ws(ps — ps) 


while E(x | B) has the same value with sign reversed. The variance 
has the same value for both universes, namely 


V(x) = wilrr + pe) + welp2 + ps) + ws(ps + ps) — [E(a | A)P. 
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Reverting to inequality (6.2)’, its left side is the sum of a large number 
of independent variables x, and hence tends to become normally dis- 
tributed as k becomes large. 

It remains to obtain the values of E(x | A) and V(x) when 6 tends 
to zero. The leading terms in the expressions for the p; and the w; 
are found to be as follows. 

Let 


i= / Hd, “I, = / 2(1) dt 


where 2(f) is the normal curve, and let 2, , 21 , 22 be the ordinates of 
this curve at 0, 7, , uw. , respectively. Then, from Figure il 


a [ 2(t) dt = I, + 462, , 


\|- 


m= | Adi=t—L+ he —2). 


<2 


Similarly, 


viH 


cee Rs cae) (eitieee 20) pea — Ty = 3665 = 21); 
ps = 1, — dz — 56%, — 2), De = Ig = 3622.5 


—— 


520 - 5@ — %) ap, a Deo Bn) 


= , = 
‘ 1 
ee 


ee ear ieee 


Substituting these values in (x | A) and V(x), retaining only the terms 
of order 6°, we find 


i 


E(a | A) = oe ar “ = ‘is ie a }. (6.3) 
1 2 2 1 
Via) = 2E(a | A) = 2 oft , w), (6.4) 


writing f(w, , us) for the expression in brackets in (6.3). 

We now reintroduce the subscript v in order to sum over the k 
variates. From inequality (6.2)’ the probability of misclassifying a 
specimen from A is the probability that the random variable De CAE® 
isnegative. From (6.3) and (6.4) this variable is approximately normally 
distributed with 


ke Ik 
Mean = Soh Dine pela ii. Veniance= 2 )-d,f (lis 5 ts.) 
v=1 v=1 


Hence the probability of misclassifying an A is approximately the 
probability that a normal deviate exceeds 


WV noi ahig as) /2: (6.5) 
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In order to minimize the probability we must maximize (6.5). Since 
f(t» , U2) does not depend on 6, , expression (6.5) shows that the 
maximum is attained by substituting the same values of w,, , Ue, M 
every term, i.e. the values which maximize f(u, , U2). This reduces 
(6.5) to 


~/max. fu; , ue) VW >> 6/2. (6.6) 


With the untransformed normal variate it was shown earlier that 
the best rule gives a probability of misclassification equal to the proba- 
bility that a normal deviate exceeds $V >) 6; . Comparing this 
result with (6.6), the relative discriminating power of the best quali- 
tative to the best continuous rule is 

2 2 2 

2 MAX af bin a) =e Max ‘2 kL 4a + j= ay (6.7) 

This result holds in the sense that if a randomly chosen fraction 
2 max. f(u, , U2) of the original normal variates is retained, discarding 
the rest, the best continuous rule becomes equivalent to the best quali- 
tative rule based on all the variates. 

The result for five states is obtained by ignoring the distinction 
between the states a and 8, calling the combined state D (doubtful). 
The effect is that the last term on the right of (6.7) disappears. For 
four states we let uw, = 0, the states becoming A, a, b, B. For three 
states, the states a and b are combined into a D region. Finally, for 
two states we put uw. = 0. 

The form of the expression 2f(u, , uz), the values of u, , we giving 
the best partition of the normal curve, and the relative discriminating 
power of the qualitative variates appear in Table 7 for 2, 3, 4, 5 and 
6 states. 

Since the maxima of f(u; , we) are flat, the best values of u, and uz 
are given only to one decimal. For two states, the relative power is 
2/x, which will be familiar as the asymptotic power of the sign test. 
Use of five or six states retains over 90 percent of the power. 

The function 2f(u; , u2) has already appeared in two other problems 
involving the replacement of normal curve methods by less efficient 
methods. Ogawa [1951] obtained this function as the efficiency of 
estimation of the population mean from suitably chosen order statistics 
in large samples. D. R. Cox [1957] found the same expression for the 
relative amount of information retained by grouping the normal curve 
when the group boundaries are chosen to minimize the quantity 
E{x — y(x)}?/o’, where u(x) is the mean of the group to which z is 
assigned. So far as we can see, the three problems are mathematically 
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TABLE 7 


Tue Asymprotic ReLative DiscRIMINATING POWER OF QUALITATIVE 
To Continuous NorMAL VARIATES 


é Best values 
No. of of Relative 
states 2f(u , Us) Ww Ue power 
2 Dh =a 0.636 
3 23/Te = 0.6 0.810 
[2 (20 — 22)? 
4 25— + ————_ — 1.0 0.882 
ln ae & 
23 Ca = ae 
5 ie ek! Seats 0.4 1.2 0.920 
Ts I, — Ip 
a (4 — am)? (a — a)?| 
6 2 — — (Oy ae GS 0.942 
I, I, — I, 5 — Gi J 


different, and the identity of results, as Cox has pointed out, seems to 
depend on particular properties of the normal curve. 


7. COMPARISONS FOR SMALL NUMBERS OF VARIATES 


As a check on the utility of the asymptotic results for practical 
situations, the best points of partition and the resulting probabilities 
of misclassification were computed directly for small numbers of variates. 
For simplicity the distance apart 6 was assumed to be the same for 
all variates. 

Having calculated the probability of misclassification for a given 
6 and number of variates k, we can compute the number k’ of continuous 
normal variates that are needed to give the same probability of mis- 
classification. For these values of 6 and k, the ratio k’/k gives the 
relative discriminating power of the qualitative to the continuous 
normal variates. Unlike the asymptotic case, this ratio is of course 
dependent on the values of 6 and k. For fixed k, however, the ratio 
turns out to be almost constant to two decimal places over the range 
of probabilities of misclassification that are of practical interest. 

In the top half of Table 8, the relative discriminatory power with 
two variates is compared with the asymptotic power from Table 7 
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TABLE 8 
RELATIVE DiscRIMINATING POWER OF QUALITATIVE 
to Continuous NORMAL VARIATES 
FOR SMALL NUMBERS OF VARIATES 


2 states 3 states 4 states 5 states 
le SO ip SS k=2 k= 0 PP Ii = eS ie 2 Kec 
.50 64 we! .81 . 84 .88 .89 .92 
2 states 3 states 
k k 
2, 3 4 5 6 7 © 2 3 4 5 co 
50 74 56 70 58 ~—s«.68 64 74 76 77 78 81 


for 2, 3, 4 and 5 states. The “small-sample” powers are all lower 
than the asymptotic values. 

The lower half of Table 8 shows the results with two states and 
from 2 to 7 variates. The results have one curious feature. With an 
odd number of variates the relative powers are higher than the asymp- 
totic values. The explanation may be that none of the decisions re- 
quires the toss of a coin when the number of variates is odd. In line 
with this result is the fact that with two states and only one variate 
(k = 1) the classification rule is the same as that obtained from the 
original normal variate, so that the relative power is unity. With 3 
states (Table 8) this peculiarity does not appear, the relative powers 
increasing steadily with k towards the asymptotic value. 

The optimum values of u, and uw, for small numbers of variates 
were consistently close to the asymptotic optima. This result held 
throughout a wide range of values of 6 ~/k from 0.5 (corresponding 
to a probability of misclassification of over 40 percent) to 4. Since, 
moreover, the optima are flat, the asymptotic values of u, and up 
may be used safely with only two variates. 

A few comparisons have been made between the qualitative and 
continuous discriminators for the case of two correlated variates of 
equal discriminating power. Suppose that the continuous scales are 
arranged so that the variates have standard deviations unity, with 
means Zero in population A and 6 in population B. If they are positively 
correlated, as appears to be the more common situation in applications, 
results obtained for 2 and 3 states indicate that the qualitative dis- 
criminator has higher relative power than when the variates are in- 
dependent. ‘The reverse seems to hold with negative correlation. 
Consequently our results for independent variates cannot be assumed 
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to be valid for correlated variates. More investigation is needed. 

In order to transform a continuous into a qualitative variate in 
an application we compute the means J, , 7, of the continuous variate 
in the initial samples from the two universes and the pooled within- 
universes standard deviation s. Let 7 = (G4 + 9Gz)/2. With, for 
instance, five states the best asymptotic values of u; » Ug are u, = 0.4, 
Uz = 1.2. The states are constructed as follows. 


Value of y State 


less than 7 — 1.2s 

from 7 — 1.2sto 7 — 0.48 
from 7 — 0.4s to 7 + 0.4s 
from 7 + 0.4stog + 1.28 
greater than 7 + 1.2s 


eo CaS iS 


8. ESTIMATION OF THE RELATIVE FREQUENCY WITH WHICH 
SPECIMENS COME FROM THE TWO UNIVERSES 


If specimens present themselves with unequal frequencies 7, 7’ 
from the two universes, the optimum rule is to assign a specimen to 
U if rp; > xp. This rule requires an initial estimate or guess about 
the value of 7. With qualitative variates the accuracy of this figure 
will often be not critical, because the same optimum rule holds over 
a range of values of 7, although the range becomes shorter as the number 
of states increases. Table 9 gives an illustration for 4 states. The 
horizontal line marks the boundary of the best decision region. 


TABLE 9 
Tue OptimuM RULE FOR DIFFERENT VALUES OF 7 


Range of x Range of x Range of + 

O11 — .4 .4— .64 HL = OY) 
State Di pt ps Ps Pi pt 
i ao Zee OL .92 .01 .92 .01 
2 .06 .04 .06 .04 .06 .04 
3 04 .07 .04 .07 04 .07 
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If we guessed r = 1’ = 0.5, we would use the middle classification 
rule in Table 9. This remains the best rule so long as 7 lies between 
0.4 and 0.64. Even with z outside these limits, this rule may be close 
to the best. For instance, if + is 0.7, the average frequency of mis- 
classification using our rule is (.7)(.04) + (.3)(.05), or .043, as against 
(.3)(.12), or .036 with the optimum rule. With + = 0.8 the corre- 
sponding figures are .042 and .024, a more serious difference. 

When m specimens have been classified, the data give the numbers 
of specimens r; found in each state, where >> r; = m. If the p, , pi 
are known, the 7; follow a multinomial distribution with probability 


P; = TD; ae 1p 
in the zth state. The log likelihood is 


S aL rp; — p') 
— et 8.1 
= Din; log pe tn) ae PE ae (8.1) 
aL ri(ps — pi)” : nd (p: — pi) 
ee - Se ; pi)” “ LODE ta reset 
so that the estimated variance of # is 
be , 


The maximum likelihood estimate # obtained ie setting 0L/dr = 0 
in (8.1) can be found by trial, although this is tedious if the states 
are numerous. One way of obtaining a first trial value of z is to note 
that each state furnishes an unbiased estimate. Since 


E(;) = mP; = map; + x’p'), 
the estimate #; from the 7th state is 
= [(r,/m) SE pil/ (p: — pi): V@) = P.Q./m(p; = pi). 


These estimates are likely to differ markedly in precision. A fairly 
good method for a first trial # is to combine the states for which p; > p/, 
making a single estimate from these states. Table 10 shows a numerical 
example for a sample of 100 specimens. 

From the first two states a trial value is computed as 


0.67 — 0.2 
0.6 


The values of r;(p; — pi), P; and r;(p; — p!)/P, are next computed 
(Table 10). The total of the latter values gives 0L/dm as —5.2. Since 


A 
Cie SS 


= 0.78. 
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TABLE 10 

EXAMPLE OF THE M. L. EstImMATION OF 7 (CompUTED FOR # = .78) 

pi Di ri ri(pi — pt) Py r(py— py)/P: 
.o ek 3¥/ 14.8 .412 35.9 
3 gal: 30 6.0 . 256 23.4 
ay 2 9 —0.9 .122 —7.4 
al Ae 24 a) .210 Sil, I 
100 =5.2 


0L/dr is a decreasing function of 7 the second trial must be lower. 
The estimate #, = 0.75 gave +0.5 for the first derivative. From (8.2) 
the estimated standard error of #, was found to be +0.075. 

Examination of the formula for the standard error shows that the 
estimate # cannot be expected to be precise: the standard error is 
always larger than the value Wzx(1 — 7)/m that would apply to a 
binomial estimate of 7. This suggests that it will be worthwhile to 
estimate a only after a substantial number of specimens have been 
classified. 


9. SUMMARY 


This paper deals with the problem of assigning specimens to one 
of two or more universes when the measurements on each specimen 
are qualitative, each taking a small number of states. After presenting 
the optimum rule for classifying the specimens, three problems are 
considered. 

The construction of the rule requires initial data on a number of 
specimens known to be classified correctly. Standard classification 
theory assumes that these initial samples are infinite in size, although 
in practice they may be only moderate. The principal effects of the 
finite sizes of the initial samples are that the probability of misclassifi- 
cation of the rule derived from them is underestimated and that this 
rule may be inferior to the theoretical optimum rule that we could 
construct if we had infinite samples. Methods are proposed for ob- 
taining reasonably unbiased estimates of the performance of rules 
derived from finite samples and for estimating the difference between 
the actual and the theoretical optimum probability of misclassification. 
It appears that initial samples of size 50 from each of two universes 
should be adequate if there are not more than 8 multivariate states. 
With greater numbers of states, larger sample sizes are needed to ensure 
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that the actual rule will be almost as good as the theoretical optimum. 

If most of the variates are qualitative but a few are continuous, 
one possibility is to transform the continuous variates into qualitative 
ones, particularly since classification is easier with qualitative than 
with continuous variates. Asymptotic results are obtained for the best 
points of partition and the probabilities of misclassification when a 
large number of independent normal variates are partitioned to form 
qualitative variates. For qualitative variates with 2, 3, 4, 5 and 6 
states the relative efficiencies are 64, 81, 88, 92 and 94 percent re- 
spectively. Computations for small numbers of variates show that 
the asymptotic points of partition remain satisfactory although the 
relative efficiencies are in general lower. 

The optimum rule depends on the relative frequencies with which 
specimens to be classified present themselves from different universes. 
Initial estimates of these frequencies must be made in order to set 
up the rule. With two universes, maximum likelihood estimates of 
the frequencies from the data for specimens that have been classified 
by the rule are given. These estimates enable the rule to be improved 
if the initial estimates differ from the frequencies that apply when the 
rule is being used. 
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INTRODUCTION AND SUMMARY 


Tuberculin hypersensitivity results from stimulation of the reticulo- 
endothelial system by tubercle bacilli and is due to the production of 
sensitised lymphocytes, that is, lymphocytes carrying antibody specific 
towards tuberculin. The presence of this antibody is detected by the 
Mantoux Test in which fluid containing tuberculin is injected into the 
superficial layers of the skin. This intradermal injection forms a bleb 
which quickly spreads outwards and is absorbed, depositing the tuber- 
culin in the superficial layers. After deposition the tuberculin either 
escapes into the circulatory system or is reacted upon in situ by the 
antibody resulting in a final distribution of ‘‘reaction products” about 
the point of injection. It is postulated that induration or hardening 
of the skin will occur wherever the areal density of these products 
exceeds a certain level and that this level actually obtains at the edge 
of induration. In other words the areal density of tuberculin decreases 
from the point of injection and induration occurs wherever it exceeds 
a certain minimum level, the Minimum Tuberculin Concentration 
(c.f. Miles [1949]). 

In this paper a mathematical model is presented by which the 
antibody-level may be calculated from a knowledge of the dose and 
concentration of tuberculin used and the resultant area of induration, 
provided this is non-zero. A statistical model of the multi-hit type 
deals with the antibody-level induced by different amounts of bacillary 
stimulation and thereby provides a measure of the virulence of the 
particular tubercle bacillus. The models can be applied to the results 
of skin-tests for any antibody by means of an appropriate antigen 
but for convenience they will be explained in the context of data from 
the Mantoux Test. 

R. A. Fisher [1949] made a statistical analysis of the results of 
intradermal injections in cows in order to standardise different tuber- 
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TABLE 1 


Errect or TUBERCULIN DosE 1n Constant VOLUME 
ON DIAMETER OF INDURATION 


Diameter mm 


No. of Ne Areal 
subjects OMEAWe aol Difference 
Group I 
2 11.0 2.0 117 
8 12.0 4.0 128 
11 12.5 5.0 131 
il 12.5 5.5 126 
10 12.5 6.0 120 
5 1320 6.5 127 
8 13.0 G20 120 
2 13.0 Go 113 
4 13.5 Ce 133 
il iL 5) 7.9 126 
11 13.5 8.0 118 
2 14.0 7.0 147 
2 14.0 8.0 132 
1 14.0 8.5 124 
3 14.0 9.0 115 
1 14.5 8.5 138 
15 14.5 9.0 129 
1 14.5 10.0 110 
4 15.0 9.5 135 
17 15.0 10.0 125 
3 15.5 10.0 140 
1 lies 10.5 130 
2 altsyet9) LEO) 119 
2 16.0 10.5 146 
2 16.0 11.0 135 
2 16.0 12.0 112 
1 16.5 Wt) 140 
12 16.5 12.0 128 
1 Ile 0) 12.0 145 
il 18.0 145,13) 142 
10 18.0 14.0 128 
2 18.5 15.0 117 
2 19.5 16.0 124 
4 20.0 16.0 144 
2 20 .0° 16.5 128 
1 20.5 16.5 148 
1 22.0 19.0 123 
2 23.0 20.0 129 
1 27.0 25.0 104 
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TABLE 1—(Continued) 


Diameter mm 
No. of 


ie Areal 


subjects 1ORRACE I IROL Difference 
Group II 
10 10) 4.5 124 
11 12.5 6.0 120 
5 14.5 9.0 129 
5 15.0 10.0 125 
= 20.0 16.0 144 
6 20.0 16.5 128 


culins. He concluded “the precision of such (tuberculin) readings 
regarded as biological assay seems to have been much underrated.” 
Our work shows that a similar precision can be revealed by statistical 
and mathematical analysis for observations on human subjects and 
guinea-pigs. 


EXPERIMENTAL MATERIALS AND METHODS 


Tests were made on 260 human subjects and 10 guinea-pigs. The 
tuberculin was PPD (human) from the Ministry of Agriculture and 
Fisheries Institute, Weybridge. The B. C. G. was the liquid form from 
the Danish Serum Institute and the virulent tubercle bacillus was 
the strain Myc. Tub. H37Rv from the National Institute of Type 
Cultures. The injection-technique was carefully designed to reduce 
the error variance of the volumes injected. The diameters of induration 
were measured to 0.5 mm. at 72 hours after injection. 


RESULTS AND THEORY 


1. The relation between induration-area and dose of tuberculin in constant 
volume 


About 250 subjects were given intradermal injections of 1 and 10 
tuberculin units (T. U.’s) in 0.1 ml. Table 1 gives the results of those 
161 subjects (Group I) who gave readable non-zero reactions to both 
doses. Another 40 subjects (Group II), known to be negative reactors, 
were BCG-vaccinated and then tested with 10 and 100 T. U.’s in 
0.1ml. Three levels of BCG-dose were used which explains the bunching 
of observations in this group. 

The square of the diameter of induration D will be called the in- 
duration-area, written as D{} , Dj, , Dioo for the three tuberculin doses. 


36 BIOMETRICS, MARCH 1961 


600 700 


D 
500 


300 400 


« GROUP I 


200 


+ GROUP II 


100 


fo) 100 200 300 400 500 600 700 
2 
Dy Dit MM 
FIGURE 1 


Tuer AREAS OF THE PAIRED TUBERCULIN REACTIONS OF Groups J AND II 


The areas are plotted in Figure 1. The areal differences A (either 
Di, — D? or Diy, — D%,) are given in the final column of Table 1. There 
is no discernible regularity in the A’s of either group and.95 percent 
confidence limits for the difference of their means are — 1.4, 3.8. Pooled, 
their standard deviation of 7.5 is consistent with constancy and an 
observational error of standard deviation 0.21mm. Fora representative 
areal difference is derived from reactions of sizes D’ and D with means 
15 and 10 mms. respectively. Then 


6A = 30 6D’ — 20 6D 
s.d. A & (900 var 6D’ + 400 var 6D)'” = 7.5. 


These results suggest that the relation between the induration-area 
and the dose d of tuberculin in T. U.’s is 


D® = a* logd + B (1) 


where a* is a constant for all 201 subjects and 8 is a personal constant 
containing all the information in the reaction relevant to the subject’s 
sensitivity. The least-squares estimate of a* is 126.7 (s. d. 0.5). For 
a given subject the critical dose of tuberculin which will just produce 
induration is obtained by putting D = 0 in equation (1). 

Additional confirmation of the equation was provided by testing 
four subjects with a range of doses in 0.1 ml. The results are given 
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TABLE 2 
THE OBSERVED AND PREDICTED INDURATION AREAS AT CONSTANT VOLUME 


Dose of PPD in T.U. (d) 


Subjects 10 5 2.5 1 Ons 0.25 Ora 
1 256 225 196 — 100 — Eoae 
(218) (180) (91) (3) 
2 144 110 72 20 0) 0 
(106) (68) (17) (0) (0) (0) 
3 144 110 72 16 (0) 0 0 
(106) (68) (17) (0) (0) (0) 
4 110 — 36 — _ 0 0 
(34) (0) (0) 


in Table 2 where the bracketed figures are the areas predicted from 
equation (1) with a* = 126.7, using the observed areas at 10 T. U. 
to estimate ~. 


2. Effect of change of volume of injected fluid on (i) induration-area (12) 
area covered by fluid. 


16 subjects were each given four simultaneous intradermal tuber- 
culin injections, 10 and 1 T. U. in 0.1 and 0.2 ml., and two intradermal 
injections of Evans blue in volumes of 0.1 and 0.2 ml. The results 
are given in Table 3 where F is the diameter to which the dyed fluid 
spread after two hours. For the tuberculin results, at each volume the 
areal difference A from 1 to 10 T. U. is calculated for the cases when 
both areas are non-zero. The average of the A’s at 0.1 ml. is 133.2 
which is not significantly different from 126.7. Half the average at 
0.2 ml. is 121.8 which is only 4.9 less than 126.7. The suggestion is 
that the rate of change of area with log-dose is proportional to the 
volume of fluid V. (In fact, for all but one subject the A at 0.2 ml. 
is less than twice the A at 0.1 ml., so that the relation is probably not 
exact.) Equation (1) can be extended to give 

D’ = aV logd + B(YV). (2) 
Comparison with equation (1) shows that 6(0.1) = 6 and 0.1 & = a*, 
so that a can be estimated as 1267. For the dye-injections, the results 
suggest that F’/V is independent of V or that 


F? = «V. (3) 
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Model for the deposition of tuberculin 


By requiring that d exp (— D’/&V log, e) be constant for constant 
V, equation (2) suggests that the fluid spreads and is absorbed as Op 
before absorption, it assumed the shape of a circular-normal distri- 
bution truncated at diameter F and with a variance s° proportional to V. 

For such a distribution the areal concentration of fluid f(r) at 
radius r would be 


f(r) = [V exp (—4r’/s’)]/2zxs°[1 — exp (—F’*/8s°)] 
for r < 3F. If 6V is the extra fluid conceptually needed to extend the 
circular-normal distribution to infinity then 
V/V + 6V) = 1 — exp (—F’/8s’) = 
say. If 8s° = aV where a is a constant, then use of equation (3) gives 


uw = 1 — exp (—x/a) (4) 


so that uw is also a constant. If ¢(r) is the areal density of deposited 
tuberculin at radius r in T. U.’s/mm’, then 


t(r) = 4d exp (—4r’/aV)/nrapV. (5) 


Denoting the Minimum Tuberculin Concentration by 7, this will obtain 
at the edge of induration provided D < F; so ce ete pe |) 
and ¢(3D) = 7 in equation (5), we obtain 


D’ = aV log, (4 d/rayrV). (6) 


When D = F, the Minimum Tuberculin Concentration may be exceeded 
at the edge of induration and equation (6) will not be true. Reference 
to Table 3 shows that usually D < F. 


Agreement with the data 


Equation (2) agrees with equation (6) when & = a log, 10 and 
B(V) = &V log (4 log,10/maurV). By equation (4) we find that 
the four levels of « represented in Table 3 have corresponding levels 
of 1 — » of 10°**, 10°”, 10°**? and 10°*°. So for these subjects 
yw & 1 and the relation becomes 


D? = aV log (4d log, 10/rarV) (7) 


to a good approximation. (It is, however, conceivable that, if more 
subjects had been examined, smaller values of » would have been 
found.) Equation (7) predicts the relationship between D and V when 
d is held constant. Table 4 gives the results of calculations of log 7 
for the subjects of Table 3. For each subject, the model is supported 
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by (i) the constancy of the log 7’s calculated from non-zero diameters, 
(ii) the fact that for a zero diameter the log 7 calculated by putting 
D = 0 in equation (7) is smaller. From equation (7) it follows that, 
when V is increased, D® will also increase but that a volume is ulti- 
mately reached above which D? will decrease. When d = 10 T. U.’s 
and log r exceeds — 1.238 or when d = 1 T. U. and log 7 exceeds — 2.238, 
we would expect the D’ for 0.1 ml. to exceed that for 0.2 ml. The 
expectation is supported by subjects 1, 2, 3 and 4. For subjects 1 
and 2 at 10 T. U.’s and subject 4 at 1 T. U., the effect of increasing 
V is to dilute the tuberculin so much that at no point in the entire 
area of the reaction is the Minimum Tuberculin Concentration reached 
and no induration develops. For subjects 5-16 this dilution effect is 
more than offset by the increased spread of the fluid which results in 
tuberculin being carried further from the site of injection at more than 
the necessary minimum concentration. 


3. Antibody level related to (4) Minimum Tuberculin Concentration and 
(it) bacillary stimulation. 


(i) Let ¢(7) be the areal density of tuberculin at a particular point 
at time 7 after deposition. The rate of escape of tuberculin into the 
circulatory system without reaction with antibody will be [¢’(T)]z = 
—é(T) where ¢ is a constant for a given subject (Kety [1949]). If \ 
is the antibody density, the rate of accumulation of reaction products 
will be r’(T) = dé(T) and the rate of reaction of tuberculin will be 
[t’(T)]e = —wr'(T') where w is a constant. Therefore 

UT) = [#(T)Je + (TD) = —(e + od) e(T) 
iT) = 40) exp [—(e + wd)T] 

r(T) = M(0){1 — exp [—(€ + wd)T]}/(e + @d) 
r(co) = r#(O)/(e + wr). 


If induration occurs wherever r(#) > p then 


T= 7/A + 6 (8) 


where y = ep and 6 = wp. 


(ii) The multi-hit model. 


Injected (or otherwise acquired) bacilli will be distributed through- 
out the reticulo-endothelial tissues which consist of lymphocyte-producing 
centres. ‘The bacilli stimulate these centres to pour sensitised lym- 
phocytes (antibody) into the circulation. Suppose (a) the bacilli are 
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distributed randomly among a much larger number of centres, (b) at 
least y bacilli are required to sensitise a centre, (c) the resulting is 
proportional to the number of sensitised centres. If B bacilli are 
distributed among C centres and p,(B) and s,(B) are the mean propor- 
tions of centres with r and r or more bacilli respectively, then 


Cs,(B + 1) — Cs,(B) = p,-,(B) r=1,2,.:-- 
or approximately, when B is large, 
Cs;(B) = s,-,(B) — s,(B). 


Therefore 


d ; 
(c4+1)s@) =%=1 


the solution of which is 
s(By = 1—e "fl + (B/C) + --- + {1/e — DYG/Ccy”)] 
= 6 *°[/ri(B/CY + {1/¢ + I!}(/C) + ---). 


TABLE 4 
Loa Minimum TuBERCULIN CONCENTRATION (7) 
Injection- 
volume 0.1 ml. 0.2 ml. 
Average 
Dose of PPD | 10 T.U. iL ALAY). LOU AOL, log +r K 
Subject 
1 —0.76 | (—1.64) | (—0.94) | (—1.94) —0276 3960 
2 —0.83 | (—1.64) | (—0.94) | (—1.94) —(0).83 1655 
3 —1.02 | (—1.64) —1.04 (—1.94) al -(033} 1655 
4 —1.87 —1.83 —1.88 (—1.94) —1.86 1655 
5 —2.41 —2.42 —2,29 —2.41 —2).38 3960 
6 —2.41 * —2.42 —2.36 —2.41 —2.38 3960 
i —2.41 —2.42 —2.44 —2.46 —2.43 3960 
8 = Hs —2.77 —2.76 —2.82 —2.78 3960 
9 —2.78 Thi —2.85 —2.82 —2.80 2490 
10 —2.92 —2.77 —2.93 —2.95 —2.89 3960 
im 4, NP =Boddl =—3 02 Sl —2.93 3960 
12 = 8). 18) —o- 18 = 3.21 8) —3. 20 3960 
13 —3.48 —3 41 —3.40 —3.36 =o. 4 3960 
14 Satie) Bi TS) Se licoy —3.85 —3. 81 8408 
15 —4.81 —4.79 —4.73 —4.81 —4.78 8408 
16 —6.82 —6.57 —6.64 =O. =6 700 8408 
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If B/C is small, we therefore expect \ « B" or 
log = vlogB+ 7 (9) 


where 7 is a constant. The integer v provides the measure of bacillary 
virulence. 


Agreement with data. 


(i) Human Subjects. 39 female subjects, whose preliminary skin-tests 
showed no induration, were vaccinated with a range of B. C. G. doses 
and after six weeks were tested with 100 T. U.’s in 0.1 ml. The log 7’s 
calculated from equation (7) are given in Table 5. Each subject’s 
B. C. G. dose establishes a level of 4. Table 5 shows that the variance 


TABLE 5 
Errect or B.C.G. DosE on S1zE oF INDURATION 


No. of Vol. B.C.G. Induration Average 

subjects in ml. diameter (mms) log 7 log + 
2 0.300 32.0 ST =f te) 
1 0.300 32.5 = 504 
2 0.250 30.0 —6.74 —6.90 
1 0.250 31.0 —7.22 
1 0.245 28.0 —5.82 =—6.16 
il 0.245 29.5 —6.50 
1 0.155 25.5 atl —4.77 
1 0.150 24.0 —4.18 —4.47 
3 0.150 25.0 —4.57 
1 0.140 24.0 —4.18 —4.18 
1 0.130 23 .0 —3.81 =3.81 
2 0.120 22.0 —3.46 
1 0.120 22.5 —3.63 —3.59 
1 0.120 23.0 —3.81 
1 0.100 20.0 —2.79 Soe 
1 0.080 16.0 =i Ge Si 
1 0.080 17.0 il Oe 
1 0.075 14.5 =i 80 
3 0.075 15.0 a —1.41 
1 0.075 15.5 SDS 
2 0.070 14.0 == 11, 11} 11 0 
1 0.070 1555 il {33 
1 0.065 12.0 N77 =() 70 
3 0.060 10.0 —0.43 
2 0.060 10.5 —0.51 —0.51 
3 0.060 11.0 —0.59 


TUBERCULIN HYPERSENSITIVITY 43 


of Minimum Tuberculin Concentrations within dose-levels is much 
smaller than the between-dose variance. This implies that the constants 
y and 6 differ only slightly, if at all, among these subjects. If 6 is 
constant from subject to subject, it is clear from equation (8) that it 
is less than any r occurring. Therefore 6 < 1077°”. The average of 
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FIGURE 2 
Tue Errect or B.C.G. Dosz on THE Minimum TuBERCULIN CONCENTRATION 


log r* is plotted against the log B. C. G.-dose in Figure 2. The least- 
squares straight-line has the equation 


log 7’ = 10.1 log B.C.G. + 12.9. (10) 


(95 percent confidence limits for the slope are 9.8, 10.4. A significantly 
better fit is possible by allowing a quadratic term but the improve- 
ment is only slight.) It is reasonable to extrapolate and state that 
higher B. C. G. doses would result in much lower 7’s than 10™° 
T. U.’s/(mm)”. So for all subjects we will put 6 = 0 in equation (8) 


and write 
7 y/X. (11) 
Since y = ep and 6 = wp, the interpretation of 6 = 0 is that most of 


the tuberculin escapes without local reaction. 
As stated above, the variance of y from subject to subject must 
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be quite small. This is somewhat surprising since y is presumably 
related to skin-type. However, all 39 subjects were aged 18 or 19 
so that perhaps y is determined by age. The subjects of Table 4 throw 
some light on this. The relation of the area of fluid coverage constant « 
with age is.as follows: 


a 
| 


= 1655, ages over 65; 

x = 2490, age 55; 

3960, ages between 30 and 45; 
= 8408, ages under 30. 


a 
I 


a 
| 


The negative correlation is statistically significant (P = 0.05). There 
is a (non-significant) negative correlation of x and 7. Accepting both 
correlations as real, there would be two possibilities consistent with 
the theory: (i) y is independent of age but both « and \ decrease with 
age, the latter due to decreasing bacillary stimulation. In this case 
the determination of « by dye-injection has no relevance to the inter- 
pretation of the Mantoux reaction. (ii) y and x are correlated, both 
being functions of skin-type. The data considered does not distinguish 
these two. One experimental approach would be to take negative 
reactors at different ages and induce possibly identical antibody levels 
by B. C. G. vaccination. 
Equations (10) and (11) combine to give 


log \ = 10.1 log B.C.G. + 12.9 + log y. 


The confidence interval for the slope includes the value 10 so that, 
the results are consistent with equation (9) when »v = 10. 


(ii) Guinea-pigs. 


Each of five dose-levels of virulent Myc. Tub. bacilli was given 
by intraperitoneal injection to one of five pairs of guinea-pigs. The 
animals were tested with 100 T. U.’s in 0.1 ml. on the 10th, 18th, 27th 
and 35th days after injection. On one occasion each pair was also 
tested with 320 T. U.’s. The average diameters for each pair are 
given in Table 6 with the 320 T. U. results in brackets. The simul- 
taneous diameters at 100 T. U. and 320 T. U. are consistent with equa- 
tion (1) with the same value of a* as for humans 126.7. For the dif- 
ferences D329 — Dio are 56, 56, 61, 66 and 69 with an average of 62. 
By equation (1) the difference should be 64. If equations (7) and (11) 
also hold, the antibody levels, log (A/y), can be calculated. These are 
plotted in Figure 3. For these animals the bacillary content is increasing 
with time. If the increase is geometric, the actual bacillary content 
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TABLE 6 
Tue INpuRATION DiaMETERS PRopuUcED BY DirrERENT Dosus oF MycoBAcTrERIUM 
TuBERCULOsIS (H 37 Rv) in Guinra-Pias 


Dose of infecting organisms; Log mgms. 


Days wet weight of bacilli 
after 
infection = 11. TANS al e}$)3) —1.990 —2.087 —2.184 
10 9 8 5 2 0) 
18 10 9 7 5 (9) 0 
27 11 10 (12.5) 8.5 7 5 
35 12 (14.5) fi 10 (18) 8.5 TGLO5) 


on the 7th day will be B,10*” where B, is the initial dose and k is the 
growth-rate constant. If equation (9) holds, then 


log (A/y) = v log Bo + vk log T + const. 


The least-squares estimates are ? = 2.05, & = 0.0104 and const. = 3.72. 
The lines drawn in Figure 3 are based on these estimates. In view of 
the small number of observations a slope of 2.05 is consistent with a 
true value of » of 2. 


0-8 


LOG: ie. 
0-6 


0-0 O-2 O-4 


—O-2 


-0-4 


=2°2 =A —PEX®) alle) Hits} 
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Tur Revation Between ANTIBODY LeveL AND Inrecrine DosE or Myc. Tus. 
(H 37 Rv) ar 10, 18, 27 anp 35 Days Arrer INJECTION 
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If the models are valid, it must be concluded that, whereas at least 
10 B. C. G. bacilli are needed to activate a lymphocyte-producer in 
humans, only two virulent Myc. Tub. bacilli are required in guinea-pigs. 


DISCUSSION 


The conclusion of Section 1 of Results was that there exists a linear 
relation between the induration-area and the log-dose of tuberculin 
in constant volume. This contradicts the conclusion of Long and 
Miles [1950] with tuberculin and of Miles [1949] with diphtheria toxin 
that the linear relation is between diameter and log-dose. In Long 
and Miles [1950] it is stated that Wadley [1948, 1949] had found such 
a relation. However examination of Wadley’s paper shows that the 
bulk of the work established a linear relation between log-dose and the 
imerease in thickness of the induration produced by tuberculin injection 
in cattle. Miles [1949] rests the case for his linearity on visual inspection 
and analyses of variance. In his Table 1 there is a statistically signifi- 
cant departure from linearity but the associated F-value is small com- 
pared with that for the linear component. However, provided the 
range of reaction-diameters is not too large, this may also be expected 
to result from our equation (1). For example, with five doses at two- 
fold dilution intervals and the diameters ranging from 10 to 17.5 mms, 
the mean square for linearity would be several hundred times larger 
than the mean square for departures from linearity. In other words 
for this range the two theories are not clearly distinguishable. Miles 
observes, and it is apparent from his figures, that the departures from 
linearity tended to occur with small induration-diameters—the curves 
dipping more steeply to the 3-5 mm values”. He invokes a special 
argument in explanation of this. However this kind of departure is 
precisely what our equation (1) would lead us to expect. 

In this paper an attempt has been made to present a validation of 
two models without considering whether they can be justified on more 
general grounds. A fuller discussion of their clinical and immunological 
implications will be published elsewhere. 

Further experimentation is clearly needed to deal with such questions 
as the following: 

(i) Are the constants or even the form of the dose-response relation 
of the intradermal test dependent on experimental technique? 

Gi) Is the multi-hit model correct for bacilli other than BCG and 
the Mycobacterium Tuberculosis used? 


(ii) Is the approach adopted useful for diseases other than tu- 
berculosis? 
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OPTIMUM ESTIMATION OF GRADIENT DIRECTION 
IN STEEPEST ASCENT EXPERIMENTS 


SamugEL H. Brooxs anp M. Ray Mickry 
C-E-I-R, Inc. 
Los Angeles, California, U.S.A. 


Summary 


In the steepest ascent method for seeking maxima (Box and Wilson), 
the procedure involves the progression to a region of high response by 
successive inferences as to the gradient direction in sets of trials en 
route. When there is no experimental error, and the response surface 
is well approximated by a plane in any locality, then it can be seen 
that an appropriately patterned set of trials, numbering just one more 
than the number of factors, would be required to completely charac- 
terize this plane and thus the gradient direction. Were there experi- 
mental error, however, the gradient direction would be correspondingly 
uncertain. One wonders whether to expect that more ground would 
be gained by having more trials in each set at the expense of having 
fewer sets. As this problem is formulated herein, there is a strong 
conclusion that the number of trials per set should be minimal. 


I. Introduction 


The procedure of steepest ascent (Box and Wilson, [1951]) consists 
of performing a sequence of sets of trials. Each set of trials is “centered” 
at a combination of factor levels, or point in factor space. An initial 
point is selected for the center of the first set of trials. Succeeding 
center points are determined by estimating the appropriate relative 
changes among the factor levels, i.e., the direction of the change, from 
the results of the set of trials just performed and scaling the increments 
of change such that the new center point is a fixed distance, the step 
length, away from the old. This procedure is followed until a near 
stationary region is attained. 

This investigation is concerned with the allocation of effort to the 
trials in the initial stages of the search, where one is not likely to be 
close to a point of maximum response. It is assumed that the choice 
of scales for the factor levels has been decided upon, so that a distance 
is defined, and that a step length has been selected. The purpose of 
a set of trials is (and we here regard this as the sole purpose) then to 
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supply the direction of the “step” for determining the next center 
point. Box and Wilson [1951] have shown that the appropriate direction 
is that of the gradient at the new point. Consequently the purpose of 
a set of trials is to provide an estimate of the gradient direction. 

It is assumed that the response surface is well approximated by a 
plane in any locality. Then the gradient direction estimated as a result 
of a set of trials differs from that of the true gradient by some angle, 
say 0, when there is experimental error. The greater the number of 
trials in a set, the smaller @ is expected to be. Let us designate by S 
the size of step to be taken in the estimated gradient direction. As 
a result, S cos @ is the component of this step which is in the direction 
of the true gradient. If cos 6 were near to one, the gradient would be 
well estimated and the step would be well taken. If cos @ were near 
zero, there would be comparatively little change in the response to 
be expected. If cos 6 were negative, the step would be downhill. Thus, 
S cos @ may be regarded as the improvement in position in factor space 
as a result of a set of trials. 

Let us regard a trial as a unit of effort, so that for ¢ trials in a set, 
the improvement per unit of effort is (S cos 6)/t. In planning a set 
of trials it is desirable to select ¢ so as to maximize the expected value 
of the improvement per unit effort, i.e. E[(S cos 6)/t]. This is equivalent 
to selecting ¢ so as to maximize (1/t) H(cos 6), where E'(cos 6) is regarded 
as a function of ¢. 

Our main result is that the maximum is achieved when ¢ is one more 
than the number of factors; that is, it is not worthwhile to improve the 
precision of the estimate of the gradient direction at the expense of 
more trials. Since this result would be anticipated if the experimental 
variation could be ignored, it is of interest that it applies independently 
of the magnitude of the variability. The result is convenient from the 
point of view of application, since otherwise the appropriate value of 
t would depend upon the magnitude of the gradient and upon the 
experimental variability, the ratio of which would have to be estimated 
in advance of the set of trials. Consequently, the simplicity of the 
result enhances its value. 

The result is developed from the assumption that the trial design 
yields normally distributed uncorrelated estimates of the gradient com- 
ponents which have the same variance. The magnitude of the common 
variance will depend upon the design used. Although the present 
result does not evaluate the relative efficiency of alternative designs, 
the above formulation of the problem does provide a basis for com- 
parison by means of E(cos 0). While we do not pursue this line of 
investigation here, some tabular values are given, so that alternative 
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designs can be readily compared from the point of view of efficiency 
for maxima seeking. 

The basic function H(cos 6) is evaluated in Section II, and some 
numerical values are presented. The development is completed in 
Section III by applying the result of Section II to establish the main 
result of the paper. Some numerical values are presented of how the 
efficiency depends on the number of trials within a set. 

Brief mention should be made of related aspects of the maximum 
seeking problem that our formulation has not considered. The problem 
of recognizing that a near stationary region has been reached may 
present difficulties in the case of large experimental error. Another 
aspect is the effect of the curvature of the response surface both on the 
selection of the design of a set of trials and the size of the region over 
which the design is to be applied and consequently on the estimation 
of the gradient direction. Finally, we note that the prior selection of 
scales of the factor levels and step length also affect the estimate of the 
gradient direction. 


II. Derivation of E(cos @) 


Let us suppose that in the vicinity of a set of trials the relation 
between the response and the n factors is well represented by a plane: 


R= Bo + Bit + Bote + --- + Bate oe 


where x; is the level of the 7th factor and e is the experimental error,’ 
NID(O, o%). 

Let us suppose, also, that as a result of a set of ¢ trials it is possible 
to construct an estimate b; of 8; such that the b; are NID(@; , o”) and a” 
is a function of ¢. 

The angle between the true gradient 8, having components #; , 
and the estimated gradient 6, having components b, , is 6. Consequently: 


cos 6 = > b.8:/V >) vi 5B? 4where 0 < 6 er. 


E(cos 6) is to be found from the joint distribution of the magnitude 
and angle of error, @ of the random vector b. It is possible to orient 
the factor space so that, without loss of generality, the components 
of 6 may be expressed, in units of o, as p, 0,0, --- , 0. Under the same 
transformation the components of b are y, , yo, --* , Ya. The y; are 


NID with 
Eiy:) = ’ ia 
QO Otherwise 


1NID (0, o,.?) means normally and independently distributed with mean zero and variance a. 
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and each with unit variance. The magnitude of b, in units of o, is r. 


The joint distribution of r and 6 is found from the joint distribution 
of the y; : 


dF(y,) = (2G hee eee ek dy, dys eve dyn : 


Noting that the quantity in the brackets can be expressed by the 
cosine law, this function, in terms of 6 and 7, becomes: 


dF(r, 6) a (2elee ten a eo aie dé dr dH, 


where dH is the region over which r and 6 are constant; that is, the 
surface “‘area” of an (n — 1)-dimensional hypersphere of radius r sin 8. 
By integration over this region: 


Tr (sin Oye 


i Ve Gene 
ars) 


This provides an expression for the expected value of cos 0: 


dF(r, 6) = 


—i/2(r?—2rp cos 0+p?) dr dé 


E(cos 6) = i) ‘| cos 6 dF(r, 6). 
8@=0 dJr=0 
Expanding e’’ °s @ in powers of rp cos 6 leads to: 


7+1 n-2 © 
E(cos @) = se 2 a _(cos 6)'* (sin 6)" ao | ger ilea 18/2 
0 7! r(t )r (2=4 = ee 2)/2 
2 


Each of the even j terms in the above expression are zero since, 
for even 7, 


I i (cos 6)'*'(sin 6)? dé = 0. 
0 


Il 


For odd j this J integral may be put in the form of the beta function 
by letting X = sin’ 6: 


x/2 
l=2 / (cos 6)'*(sin 6)"* dé 
0 
a+ 2) (? = 1) 
r( a ue my ; 
ack ‘) 
yeh 
The r integral evaluated as a gamma function becomes: 


(n+ i-2)/2 ati). 
anran(t 


1 
= f a — xox? ax = 
0) 
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Assembling these terms and letting 7 = 2¢ + 1, there results: 


ale a 
Ms r(2 = S\parnay (a 1 i 2i+1 
E(cos 6) = Biies : > = 


oe Ee) 


(n+ 1 Te 
p —p2/2 <a 2 +aJ(2 : 


+ i)i 


~ 4/2" > eee 


It may be noted that the above expression is closely related to 
the confluent hypergeometric function. 

An approximation to E(cos @) follows directly from the expression 
given for cos @ by noting that 


E( 2) bBi) = 27 Bi = po”, 
E(>> 3) = (n + p’)o’. 
By substituting expected values for the random variables one obtains: 


E(cos) & p/Vn + p’. 


This approximation is in good agreement with the values in the table 
and illustrates the general nature of this function.” 


III. Optimum Number of Trials in a Set 


In the previous section it was stated that p is in units of o and 
that o is a function of t. (cos @) is to be expressed as a function of 
t so that (1/t)E(cos @) can be examined for the ¢ which maximizes this 
quantity; that is, the number of trials in a set which results in the 
greatest expected improvement per unit of effort. 

Since the 8; are ordinarily estimated by means of linear functions 
of the observations, then o” would ordinarily be a simple multiple of 
o2 . Were that set of trials replicated, say m times, then o” would be 
the same multiple of o2/m, which would be proportional to o%/t. Under 
these circumstances o would be proportional to 1/t. It is suggested 
that this would be a good approximation of the functional relation of 
o and ¢ for more varied experimental designs than those which are 
replicated, as long as the trials would be made in the same locality. 
Consequently, since p” is in units of o°, p is proportional to t. It is 


2It can be shown that a lower bound for E(cos @) is e/V n IG A oak 
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convenient to work with a quantity which is proportional to ¢ and 
therefore to p°. Let 7, k and h,(T) be defined by the following: 


T = kt = p’/2, 
iin vale } (i-1/2) 
| 2 r( 5 + 2)7T 
(1/)E(cos 6) = h,(T) = ke >: Esa 
na r( Sr i) 
2 
As a result of differentiating this with respect to 7: 
Th) = 
aT = yi h,(T) h,(T) =F Re) 


ie 
= etl) = ke SS 
ye 7 
M+o4 2pr(" + 


st ida 


It is apparent that for any positive n and any positive 7, ¢ or p’, 
this derivative is negative. Consequently, (1/t) E(cos 6) isa decreasing 
function of ¢, so that ¢ should be as small as possible. The minimum 
number of trials required to estimate a gradient by means of uncorrelated 
estimates of its components is one more than the number of factors. 
As a result, t = n + 1 is the “optimum” number of trials to be used 
in each set of a maximum seeking experiment when such estimation is 
to be done. 

The simplex designs are very appropriate for this kind of experi- 
mentation. These and other useful first order designs are discussed 
in the reference by Box. In those cases where the factors can be assigned 


only discrete levels, the designs given by Plackett and Burman can 
be used. 


IV. Efficiency of Replication 


It is of interest to see how efficient more than n + 1 trials in a set 
would be since other considerations may make it desirable to do as 
many trials as would be worthwhile within a single set. This would 
be the case, for example, if in some agricultural applications, many 
trials may be carried out simultaneously, but it takes an entire growing 
season to do any set of trials. 

As a notational convenience, the number of trials for which t¢ = n + 1 
is designated as a single replicate, R = 1, andt = R(n + 1) is an Rth 
replicate. Let @ be the value of p when R = 1. Thus, G is the magni- 
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tude of the gradient relative to the standard error of estimation of a 
component when the number of trials used to do this estimation is 
nm + 1. For those experimental designs for which the standard error 
of component estimation is reduced in proportion to R7 2. @.g. “pure” 
replication, there results the relation p = R'’G. The efficiency of 


TABLE 2 
Erricrency or R, tHE NUMBER OF REPLICATES, FOR n, THE NUMBER OF FAcTorRS 
AND G, THE MAGNITUDE OF THE GRADIENT RELATIVE TO THE 
StaNDARD ERRor or Estimation WHEN R = 1 


The Number of Factors is n = 2 


Relative Magnitude of the Gradient 


Replicates, R 0.25 0.5 1 2 4 oo 
1 1.000 1.000 1.000 1.000 1.000 1.000 
2 0.702 0.686 0.621 0.550 0.510 0.500 
4 0.489 0.459 0.369 0.286 0.257 0.250 
8 0.334 0.292 0.203 0.146 0.129 0.125 
16 0.224 0.174 0.106 0.073 0.065 0.0625 
The Number of Factors isn = 8 
Relative Magnitude of the Gradient, G 
Replicates, 2 0.25 0.5 1 2 4 co) 
1 1.000 1.000 1.000 1.000 1.000 1.000 
2 0.705 0.699 0.676 0.618 0.547 0.500 
4 0.495 0.482 0.411 0.358 0.287 0.250 
8 0.346 0.326 0.272 0.196 0.149 0.125 
16 0.239 0.213 0.158 0.103 0.076 0.0625 
The Number of Factors isn = 32 
Relative Magnitude of the Gradient, G 
Replicates, & 0.25 0.5 1 2 4 ©0 
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an #th replicate for particular n and G is defined as the improvement 
per unit of effort expected as a result of the Rth replicate design as 
compared with the improvement per unit of effort expected when R = 1: 


Ceo 6|n, p = VRQ) 


R 
Efficiency of (R | n,G) = — E(cos 6|n, p =@) 
REFERENCES | 


1. Box, G. E. P. [1952]. Multifactor designs of first order. Biometrika 39, 49-57. 

2. Box, G. E. P., and Wilson, K. B. [1951]. On the experimental attainment of 
optimum conditions. Jour. Roy. Stat. Soc., 13, 1-45. 

3. Brooks, 8. H. [1959]. A comparison of maximum seeking methods. Operations 
Research 7, 430-57. 

4. Plackett, R. L., and Burman, J. P. [1946]. The Design of optimum multifactorial 
experiments. Biometrika 33, 305-25. 


A STOCHASTIC STUDY OF THE LIFE TABLE 
AND ITS APPLICATIONS 


Ill. THE FOLLOW-UP STUDY WITH THE CONSIDERATION 
OF COMPETING RISKS'”? 


Cun Lone CHIANG 
University of California, Berkeley, California, U.S.A. 


INTRODUCTION 


Statistical studies falling into the general category of life testing 
and medical follow-up have as their common immediate objective the 
estimation of life expectation and survival rates for a defined population 
at risk. Usually such a study must be brought to a close before all 
the information on survival (of patients, electric bulbs, automobiles, 
etc.) is complete, and thus the study is said to be truncated. Whether 
the investigation is basically concerned with life testing or with medical 
follow-up, the nature of the problem is the same, although differences 
in sample size may call for different approaches. Thus methods devel- 
oped for life testing may be applied to follow-up studies when the under- 
lying conditions are met, and vice versa. In this study cancer survival 
data utilizing a large sample will be used as illustrative material, and 
we shall accordingly use the terminology of the medical follow-up study 
as a matter of convenience. 

We are concerned then with a typical follow-up study in which a 
group of individuals with some common morbidity experience are 
followed from a well-defined zero point, such as date of hospital admis- 
sion. Perhaps we wish to evaluate a certain therapeutic measure by 
comparing the expectation of life and survival rates of treated and 
untreated patients. Or we may wish to compare the expectation of 
life of treated and presumably cured patients with that of normal 
persons. When the period of observation is ended, there will usually 
remain a number of individuals on whom the mortality data in a typical 
study will be incomplete. Of first importance among these are the 


1This study was completed while the author was a Special Research Fellow of the National Heart 
Institute, Public Health Service, U. S. Department of Health, Education and Welfare. 

2Parts of this paper were presented at the joint meeting of the American Statistical Association, 
the Institute of Mathematical Statistics, and the Biometric Society in Washington, D. C., December 
29, 1959. 
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persons still alive at the close of the study. Secondly, if the investi- 
gation is concerned with mortality from a specific cause, the necessary 
information is incomplete and unavailable for patients who died from 
other causes. In addition, there will usually be a third group of patients 
who were “‘lost”’ to the study because of follow-up failure. These three 
groups present a number of statistical problems in the estimation of 
the expectation of life and survival rates. Many significant studies 
have been made along these lines, among them the early works of 
Greenwood [13] and Karn [17], the actuarial method of Berkson and 
Gage [1], a stochastic model of competing risks by Fix and Neyman [12], 
the parametric studies by Berkson and Gage [2] and Boag [3], the 
non-parametric approach of Kaplan and Meier [16], studies on life 
testing by Epstein and Soble [10] and other interesting works by Dorn 
[8], Elveback [9], Fix [11], Harris, Meier and Tukey [14], and Littell [18]. 

The purpose of this paper is to adapt the biometric functions of 
the life table to the special conditions of the follow-up study. Part I 
considers the general type of study in which survival experience is 
investigated without specification as to the cause of death. An exact 
formula will be presented for the maximum likelihood estimator of 
the probability of death and its asymptotic variance. Special attention 
will be given to a method for computing the observed expectation of 
life in truncated studies and the corresponding variance. In Part II 
the discussion will be extended to apply to studies of mortality from 
a specific cause in the presence of competing risks. The relations 
between net, crude, and partial crude probabilities will be reviewed 
and formulas developed for their estimators and the corresponding 
variances and covariances. In the last part of the paper, data obtained 
from the California State Department of Public Health will be used 
to illustrate the application of the theoretical matter presented in 
Parts I and II. 

Throughout this paper we shall assume that all individuals in a 
sample are subject to the same force of mortality (or, instantaneous 
death probability), and that the probability of dying for one individual 
is not influenced by the death of any other individual in the group. 
This is to say that the life-times of all individuals in a group are treated 
as independent and identically distributed random variables. We shall 
also assume in Part I that there will be no individuals lost to observation 
because of follow-up failure. The problem of lost cases will be con- 
sidered in Part II (Remark 3). 

For simplicity of presentation, a constant time interval (year) will 
be used. However, the methods developed in this paper apply equally 
well to cases where intervals are of different lengths; although the ob- 
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served expectation of life will have a slightly different form (Cf. [5] 
and [6]). 

The probability symbols used in this paper are listed below for 
convenient reference. Considering death without specification to cause: 


p. = Pr [an individual alive at time zx will survive the interval 
(x, x + 1)], 


I 


Pr [an individual alive at time zx will die in the interval 


(z, x + 1)], 


and obviously p, + g. = 1. When death is studied by cause, or risk, 
we have the net probabilities: 


Qz 


g-. = Pr [an individual alive at time x-will die in the interval 
(x, x + 1) if risk R, is the only acting risk of death in the 
population], 

gz.. = Pr [an individual alive at time x will die in the interval 
(x, x + 1) if risk R, is eliminated from the population]; 


the crude probability: 
Q., = Pr [an individual alive at time x will die from cause FR, in 


the interval (z, x + 1), in the presence of all other risks 
in the population]; 
and the partial crude probabilities: 

Q,i.1. = Pr [an individual alive at time x will die from cause FR, 
in the interval (a, x + 1), when only risk R, is eliminated 
from the population], 

Q...12 = Pr [an individual alive at time x will die from cause R, 
in the interval (z, x + 1), when risks R, and R, are elimi- 
nated from the population]. 


PART I. THE ESTIMATION OF THE PROBABILITY OF SURVIVAL 
AND EXPECTATION OF LIFE 


1.1. The basic random variables and their joint probability function. 


Consider a follow-up study conducted over a period of y years. A 
total of No individuals are accepted into the study at any time prior 
to the closing date® and are observed until death or until the study is 
terminated, whichever comes first. If we set the time of entrance into 


2Although we have used the common closing date method to illustrate the techniques developed 
in this paper, it should be pointed out that these techniques are equally applicable to the date of last 


reporting method. 
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the study as the common point of origin for all Ny individuals, then 
N, is taken to be the number with which the study began, or the number 
of individuals alive at time zero. Let x be the exact number of years 
since entrance into the study, and N, the number of individuals who 
survive to the common point xz. Clearly, N, may also be defined as 
the number of survivors who entered the study at least x years before 
its closing date. The number of survivors will decrease as x increases, 
not only because of deaths but also because of withdrawals due to the 
closing of the study. We will describe this process of depletion system- 
atically for the typical interval (x, x + 1) with reference to Table 1.* 

At time 2, the NV, survivors who begin the interval can be divided 
into two mutually exclusive groups according to their date of entrance 
into the study. A group of m patients entered the study more than 
x + 1 years before the closing date of the study. Out of these, 6 patients 
will die in the interval and s will survive to begin the next interval. 
The second group of n patients entered the study less than x + 1 
years before its termination, and hence are all counted as withdrawals 


TABLE 1 


DistrisutTion or N, Patients AccorDING TO WITHDRAWAL Stratus, SURVIVAL 
Stratus, AND CausE oF DEATH IN THE INTERVAL (x, x + 1)* 


Withdrawal status in the interval 


Survival status and Total number Number not due Number due for 
cause of death of patients for withdrawal** withdrawal*** 
Total Nz m n 
Survivors s+w s w 
Deaths, all causes D 6 € 


Cause of death 


Ry D, 61 €1 
Re Dz 62 €2 
Ie Dd. O« Ec 


*The subscript x, which should be added to each of the symbols in the table, is deleted to simplify 
formulas in the text. 


**Patients admitted to the study more than (z + 1) years before closing date. 


**Patients admitted to the study less than (x + 1) years but more than x years before closing 
date. 


4The second part of Table 1, death by cause, was included for use in Part II of this paper. 
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in the interval (x, x + 1), whether or not they survive, because for 
them the closing date precedes their (cx + 1)-anniversary date. Let 
us say that ¢ will die before the closing date and w will survive to with- 
draw alive. 

Thus s, 6, w, and ¢ are the basic random variables (upper part of 
Table 1) whose distribution depends on the force of mortality. The 
values that these random variables take on will be used to estimate the 
probability p, that a patient will survive the interval (x, x + 1), and 
its complement g, , the probability of death in the interval. The first 
step is to derive the joint probability function of these random variables. 

Let u,, a function of time 7, be the force of mortality acting on 
each individual in the study, such that 


u, Ar + o(Ar) = Pr [an individual alive at time 7 will die in the 
interval (7, 7 + Ar)], for 7 > 0, 


where Az stands for an infinitesimal time interval and o(Ar) a quantity 
of smaller order of magnitude than Ar. It can be shown [5] that 


pAt) = exp ey L, a.) 


is the probability that an individual alive at x will survive to (a + 2). 
If we assume a constant force of mortality within the interval (x, x + 1), 
say uw, = wu, depending only on z, for x < + S$ x + 1, then the proba- 
bility of surviving the interval is given by” 
ay ee Oi, 
For the subinterval (z, x + 4), with ¢ between zero and one, we have 
Di one = page lore Ot baal. 

Consider first the group of m individuals, each of whom has a constant 
probability p, of surviving and a probability g¢, = 1 — p, of dying in 
the interval (z, z + 1). We have then a typical binomial case with 
the probability function: 


fi: = pi(l — p,)’- : (1) 
The expected number of survivors and deaths are given, respectively, by 
E(s|m) = mp,, and E(é|m) = m(1 — p,). (2) 


The distribution of the random variables in the group due for with- 
drawal is not so straightforward. Making the assumption that, on 


5When the assumption of a constant force of mortality is strong for an interval where the death 
rate is high, one may subdivide the interval and estimate the probability for each subinterval separately. 
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the average, each of the m individuals will withdraw at the point x 15 oy 
the probability of withdrawing alive is equal to p!””, and the probability 
of dying before the time of withdrawal (1 — p}’’). Again we have a 


binomial case with the probability function® 


fe =p, ps _ ®&) 
The expected number of survivors and deaths are given, respectively, by 
E(w |n) = np”, and E(e|n) = n(1 — py”). (4) 


Since the N, individuals are divided at time x into two distinctly different 
groups according to their withdrawal status, the joint probability of 
all the random variables is the product of the two probability functions 
(1) and (8), 

fit =p. Ap.) Wp) (5) 


1.2. The maximum likelihood estimators and their asymptotic variances. 


We are now in a position to use the maximum likelihood principle 
to obtain the estimator of the probability p, and its complement q, , 
and the asymptotic variance. Taking the logarithm of the joint. proba- 
bility function (5), we have the likelihood function 


L = (s+ 3w) Inp, + 6In(1 — p,) +e ln (1 — pi”). (6) 


Differentiating (6) with respect to p, and setting the derivative equal 
to zero give the likelihood equation 


(s+ dup.’ — 84 — 6) — S977 — pi) = 0, 
which implies 
(w, ay a) aos 8 he) 0, (7) 


a quadratic equation in p17. Since #'/? cannot take on negative values, 


we have the estimators, 


Ie E + Vie + 4(V, — Ins + bo | 
Dees aN, = 3h) 


(8) 


6A more plausible assumption perhaps is that a withdrawal takes place randomly throughout the 
interval. However, under this assumption the probability of withdrawing alive is 


zr1 
{ e-(~2)s dr = —(1 — pz)/In pz, 


z 
and the resulting maximum likelihood equation is too unwieldy. When the probability of survival is 
not too low, the above expression and pz are close to each other. For pe = .70, for example, pz? = .837 
and —(1 — pz)/In pz = .840, In the case of extremely high mortality, one should subdivide the 
interval, 
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and 


ieee E Sky Ue (9) 
The maximum likelihood estimator (8) is not unbiased; however, it is 
consistent in the sense that when the random variables s, w, and « are 
replaced with their respective expectations as given by (2) and (4), 
the resulting expression is identical with the probability p,. That is, 


Bes — pt) + VW — ph)? + 4(N, — In\(mp,+ 4np) |? 
ZN, = 3n) 5 


Pe = 


To derive the formula for the asymptotic variance of the estimator 
p. , (or g.), we find the expectation of the second derivative of (6): 


aL mM. 
a(S) . -| Me cs “| (10) 
where . 
M,=m+n(1+p.”"y" (11) 
and 
"cas (1 — pi”). (12) 


i eas 
According to the theorem on the asymptotic efficiency of an estimator, 
the asymptotic variance of #, (or g,) 1s given by the negative inverse 
of the expectation (10), 


o3, = 1/[M./(p.92) + 7]. (13) 


Usually the quantity 7 in the denominator of (13) will be small in 
comparison with the preceding term, and may be neglected to give the 
approximate formula 


of, = 7.9:/M. . (14) 


The sample variance of p, (or g.) is obtained by substituting (8) and 
(9) in (18) or (14). 


Remark 1: The problem discussed here relates to the study in 
which N, is large. If N, is small, one may use the exact time of death 
of each of the D patients and the exact time of withdrawal of each of 
the w patients to estimate the probability p,. In this case, there will 
be N, individual observations within the interval (x, x + 1), and ob- 
viously it is unnecessary to consider the N, patients as two distinct 
eroups according to their withdrawal status. Let t; < 1 be the time 
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of death within the interval (7, x + 1) of the z-th death, forz7 = 1, --- , D, 
with a probability 


ey dt;, for i=1,---, D; 


let T; < 1 be the time of withdrawal alive of the j-th withdrawal, for 
j = 1, ++: , w, with a probability 

efor gis Ly ets 
then the joint probability function of all the N, observations becomes 


wo 


D 
(s+ Eti+ ET;) D 
pO a 


D w 
p: Teh (Case dt;) GE Cae = In Dz) Dz IT dt, : 
j=1 j=1 i= 
Maximizing the last expression with respect to p, gives the maximum- 
likelhood estimator (ef. [18]), 


p. = exp | —p/(s+ a+ Hr) |. 


1.3. Observed expectation of life. 


A life table for the follow-up subjects can be readily constructed, 
once , and @, have been determined from (8) and (9) for each interval 
of the study period. Let an arbitrary number J, denote the number 
of patients admitted to the study. The number J, who survive to the 
exact time x is computed from the formula 1, = pop: --+ p.-1 , and 
L./lo = Poh: +++ Pr-1 is the estimated x-year survival rate. For a patient 
alive at time x, the observed expectation of life can be expressed by 
the equation: 


é, = a, + Calls + CeesD Oe; + es 
+ CDN ae Dens sates obese 104 lea ez) De ae as ees) (15) 


where a, is the average time lived in the interval (x, « + 1) by the 
patients who die in that interval, and ¢, =.) -=-qe, Gere lh inse 
study covering a period of y years, there are no survivors remaining 
from the patients who entered the study in its first year, #,_, will 
be zero, and é, can be computed readily from the collected data. In 
the typical study, however, there will be W,-, Survivors who entered 
the study in its first year and withdraw alive in the final interval 
(y — 1, y). In such cases, it is evident from (8) that Py-1 18 greater 
than zero and the values of #, , B,4; , ‘++ are not observed within the 


limits of the study. Consequently, é, cannot be computed from equation 
(15): 
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It is nevertheless possible to estimate é, with a certain degree of 
accuracy if w,_; is small. Suppose we rewrite equation (15) in the 
form 


A 


Cz —— a, + Coe at CoreP Discs = oe i == Cy Davi vere Dye 
i A rN 
+ L Ciaps a Cy+2PyPu+1 air mae -), (16) 


where /,/J, is written for $,,., «++ By-:. The problem is to estimate 
the values of #, , B,.: , +: in the last term, since the preceding terms 
can be computed from the data available. 

‘As a first approach, consider a typical interval (z, 2 + 1) beyond 
time y with the probability of surviving the interval: 


zi 
p. = exp (—| adr) , for 2-= yy 1; tes. 


If the force of mortality is constant for z > y, the probability of survival 
is independent of z and we may write 


Pane =p, for 2 =), Ui ly ee 


Under this assumption c, = 1, and we may replace the last term of 
(16) with (1,/l,) (6 + #° + ---), which converges to (1,/1,)p/(l — #). 
As a result, we have 


A 


¢, = a, + idaary | + C400 Doe + aered 
A A A ie A A 
+ Cyp-Berr *** By-a + 7 [B/C — BI. 


Clearly, # may be set equal to #,-_, if the force of mortality is assumed 
to be constant beginning with time (y — 1) instead of time y. From 
the point of view of sample variation, however, it is desirable to base 
the estimate # on as large a sample size as possible. Suppose there 
exists a time 7’, for 7’ < y, such that fr , Prs1, -++ are approximately 
equal, thus indicating a constant force of mortality after time 7. Then 
# may be set equal to #7 , and the formula for the observed expectation 


of life becomes (cf. [4]): 
6, = Gz 1 CrshPs te Cztapspavt “Cis: 

+ eben ++ Be +7 lor/ =P], 7 
forz = 0,:::,y — 1. Whena, is approximated with 3, c, = 1. 


Although formula (17) holds for « = 0, --- , y — 1, it will be ap- 
parent to the reader that the smaller the x, the larger the value of J, , 
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and the smaller will be the contribution of the last term. If the ratio 
1,/l, is small, the error in assuming a constant force of mortality beyond 
y and in the choice of #7 will have but little effect on the value of é, . 


1.4. The sample variance of the observed expectation of life. 


To avoid confusion in notation let us denote by a a fixed number 
and consider the observed expectation of life é, as given in formula (17). 
It was proven in [5] that the estimated probabilities of surviving any 
two non-overlapping intervals have a zero covariance, and hence the 
sample variance of the observed expectation of life may be computed 
from 

Sia = yy [0¢,/0p.]'S3, ’ (18) 
where the derivatives are taken at the observed point p, , for x > a. 
Tn the present case, we have 


(oe. /0p.| = Pale -—- dU —a@)], for 2 = 7, (19) 
and 
[0¢./Opr] = Parlére: + (1 — ar) + {hry/(1 —pr)°}], for a < T, (20) 
= Pe/(l — fr), for @ >, 
where Pez = Pafar *** Ps-1 - Substituting (19) and (20) in (18) 


gives the sample variance of é 


a) 


y-1 


Si. = DO Pesléser: + (1 — a,)]°S}. 


z=a 
z#T 


+ parler: + (1 — az) + (Brv/( — 2)}7S3, (21) 
for a < T, and 
Al 
See = > Dalen ae (1 ‘aa ales. 
{Bas/ l= Br) Se, for ant Bt) 
The value of #, and the sample variance of g, are obtained from formulas 
(8) and (1 3), respectively. When a, is approximated with 3, the quantity 
(1 — a,) in formulas (21) and (21a) may be replaced by 3. 
PART ITI. CONSIDERATION OF COMPETING RISKS 
2.1. Relations between net, crude, and partial crude probabilities. 


, In a follow-up study, as in general mortality analysis, one may be 
interested in death due to a specific cause, or to a group of causes. 
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Depending upon the questions to be answered, the investigator may 
explore three general types of probabilities of death with respect to 
a specific cause, or risk: 


1. The crude probability. The probability of death from a specific 
cause in the presence of all other risks in a population. 

2. The net probability. The probability of death if a specific cause 
were the only cause in effect in the population or, conversely, 
the probability of death if a specific risk were eliminated from 
the population. 

3. The partial crude probability. The probability of death from a 
specific cause in the presence of all other risks but with a second 
risk eliminated from the population. 


Obviously, in the human population, the net and partial crude proba- 
bilities usually cannot be estimated directly except through their 
relations with the crude probability. The study of such relations is 
part of the problem of ‘“‘competing risks”, or ‘“‘multiple-decrement”’. 
The subject has been variously discussed in the literature (see, for 
example, [12], [15], and [19]) and will be reviewed here only by way of 
introducing notation. 

Assume c risks of death (or causes) acting simultaneously on each 
individual of a population (that is, competing for the life of the indi- 
vidual), and let these risks be denoted by R, , -:: , R, . For each 
risk there is a corresponding force of mortality, »,, , +++ , vr. , each 
of which is a function of time 7, and the sum of these 


ee Ry et Pe (22) 


is then the total force of mortality. Within the time interval (x, x + 1), 
we shall assume a constant force of mortality for each risk, say 
Ver = V2» , depending only on x and k, fore <7 < 2+ 1. For all 
risks, we Nave uw, == fp, p10r t= 7 Sa - 

Let Q,,(£) be the crude probability that an individual alive at time 
x will die in the interval (z, x + #), for 0 < ¢ < 1, from cause RF, in 
the presence of all other risks. It follows directly from addition and 

multiplication theorems that 


Ogi = / pe idan pore Ot <i 1s. thesis es (28) 


The first factor of the integrand is the probability of surviving from 
x to 7 when all risks of death are acting, while the second factor is 
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the instantaneous probability of death from cause R,. Integrating (23) 
gives 


Q.() == fl —e™] = = [1 — pd], 
Mz Bz 


for, (Ol 555915 Wise UF bie eG (24) 


It is clear from (22) that the sum of the crude probabilities in (24) 
is equal to the complement of p,(¢), or 


Qialt)  s*+ = O..0) + pl = Li ton eae (25) 


For ¢ = 1, we shall abbreviate Q,,(1) to Q,, , etc. For the purpose of 
this study we are particularly interested in the subinterval (x, x + 4), 
with 


Q.(3) = =e [Ee a Oar Ae De je tO He iane ta rae ala 


zc 


In this case the sum of the crude probabilities (26) is the complement 
of p.’’, the probability of surviving half the interval, and 


Qiilhspeps oeaes* Se OR NSE pial wer pee eels (27) 


When risk R, acts alone, the net probability that an individual 
alive at time x will die in the interval (z, x + 1) is 


Vek = 1 pe Guus As 1 — [Orn ees. (28) 


From formulas (24) and (28) we obtain the relation between the net 
and crude probabilities 


Gan = 1, = OO or si ee, (29) 
By analogy we can write the net probability of death in the interval 
(x, x + 1) when risk R;,, is eliminated, 


(az—-Qzk)/ae 
d 


Ger =1—D, fOr? +h = ere tCe (30) 


Now suppose that R, is eliminated as a cause of death, and let 
Q.x.1 be the partial crude probability that an individual alive at time 
x will die in the interval (x, x + 1) from cause R, in the presence of all 
other risks, for k = 2, --- , c. Using a similar reasoning as in the 
crude probability [eq. (23)], we can write 


el 
—(t=—2) (H2e—-Y 21) 
(a =a | é Fine Vik dr 
x 


= A 25 [1 wi pata ieae for k= Ze eee Os (31) 
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Similarly, if risks R, and R, are eliminated, the partial crude proba- 
bility that an individual alive at x will die in the interval Gas =nl) 
from cause FR, in the presence of all other causes is given by 


Qi i2 = Saat aac Dp eI form. @ 2, (32) 


A detailed discussion on the partial crude probabilities is given in [7]. 


2.2. The basic random variables and their joint probability function. 


The identification of the random variables in a follow-up study 
in the presence of competing risks and the derivation of their joint 
probability function follows directly from the discussion in Section 1 
of Part I. The deaths in each of the two groups according to withdrawal 
status are further divided by cause of death as shown in Table 1, Part I. 

Each of the m individuals not due for withdrawal in the interval 
(x, x + 1) will fall into one of the c + 1 mutually exclusive groups, 
depending upon whether he survives the interval or dies from cause 
R, , -:: , R. , with the sum of the corresponding probabilities p, , 
Q.1,°°° , Q:- , equal to unity [eq. (25)]. Thus we have a multinomial 
case with the probability function 


f= BQ = O25 (33) 
where s is the number of survivors and 6, is the number of deaths from 
cause R, , for k = 1, :-- , c. Their mathematical expectations are 
given respectively, by 


EG | m) = mp,., and E(i-\n)\= mQ,,, for k= 1) -*+, 6. (C64) 


Each individual in the group of n due for withdrawal in the interval 
(x, x + 1) has the probability p?° of withdrawing alive and the proba- 
bility 

Qu(3) = Qu(l +p)", for b= 1, ++, ¢, (26) 


of dying from cause R,. Since p}”” and the probabilities in (26) again 


add up to unity [eq. (27)], the m observations also constitute a multi- 
nomial case with the probability function (cf. footnote 6) 


c 


pp Qed atepe ye ss (35) 


k=1 


-where w is the number of individuals withdrawing alive and e, the 
number of individuals who die from cause R, before the time of with- 
drawal. The mathematical expectations are, respectively, 


E(w | n) = npy?, and Ele |n) = nQn(1 + p””)™, 
fOMawe= resin o. (36) 
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Because of the separation of the individuals into two distinct groups 
at time x according to their withdrawal status, the joint probability of 
all the random variables in Table 1 is the product of the two joint 
probabilities (33) and (35): 


fife = pit?” I git iB Qea(l + phy], (37) 


Formula (37) may be simplified by rearranging terms and using the 
relations, D. = 6, + «& ,ande = ¢, + --- + €,, to give the final form 
of the joint probability function 


fils 0s ep ees la Ons (38) 


2.3. Maximum-likelthood estumators of crude, net, and partial crude 
probabilities. 


We will again use the maximum likelihood principle to obtain the 
estimators of the probabilities: p, , Q.1 , --- , Q,- . In this case the 
likelihood function obtained from (88) is 


L = (s+ 4w) Inp, — eln (1 + py”) 


+Din(i-p.- Dea)+ EDInes, G9) 


where the substitution 


Q.=1-—97,— >) OQ: (40) 

k=2 
has been made. Differentiating the likelihood function (39) with respect 
to pz , V2, **+ , Q:- , respectively, and setting the derivatives equal 


to zero, we obtain a system of c simultaneous equations: 
oL/ap. = [(s + 3w)/p.] — ¢/[26:° + pY%)] — (D,/O.) = 0, (41a) 
AL/8Q.. = (Dr/Qsx) — (Ds/Qn) = 0, for k= 2,--+,c.  (41b) 
From (41b) it can be deduced that 
D./Q. = D/1 — 7p), for k= 1,---,¢, (42) 


and therefore the ratio D,/Q.; in equation (41a) can be replaced with 
D/(1 — #.). When this substitution is made and the terms in (41a) 
are rearranged we have a quadratic equation in #}” that is identical 
to equation (7) in Part I. Hence the estimators #, , in (8), 

and q, for all causes of death will have the same value as in the simple 
case where death is investigated without specification to cause, as one 
would anticipate. Substituting (8) in (42) gives the estimators 
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U5 =. (Del Dd for bel esse (43) 


To obtain the estimators of the net and partial crude probabilities, 
we substitute (43) in formula (29), (80), (31), and (32), and after 
simplification, 


fee oe fore il ea (44) 
G3 = tape, for. bel, +0 yc; (45) 


A 


Onn = 1D — Dot = po |). fom 2, «se  - (46) 


and 


Oa = (D/D =D; — DI pe 2) for R= 3, 2+, 02 (47) 
The estimators given in formulas (44), (45), (46), and (47) are also 
maximum likelihood estimators because of the invariance property of 
maximum likelihood estimators. 


Remark 2: If there were no withdrawals in the interval (x, x + 1), 
i.e., if 2 = O, the problem is reduced to the classical multiple-decrement 
problem, with s survivors and D, = 6, deaths from cause R, , for 
k = 1, +--+ ,¢. These random variables will still have a multinomial 
distribution [eq. (33)], and the formulas for the estimators of p, , @. , 
and Q,, are reduced to 


p. = 8/N. , (8a) 
Ce, (9a) 

and 
Q.= D,/N,, for k=1,-:-,c. (43a) 


Formulas (44) through (47) may still be used for the estimators of 
the net and partial crude probabilities, but with #, given by (8a). 


Remark 3: The problem of cases lost to the study due to failure 
of follow-up is still unsolved, and perhaps it has no unique solution. 
Since the probability that a patient will be lost to follow-up is in part 
dependent upon the type of a study, assumptions with respect to lost 
cases may be valid for one study but not for another. If the number 
of lost cases is small. depending upon the type of study, one of the 
following assumptions may be made and the data handled accordingly: 
(1) patients lost will have the same probability of surviving as patients 
not lost, and may be deleted from the study; (2) all lost cases survive 
to the close of the study; (3) all die at the time of becoming lost; and 
(4) becoming lost is another competing risk. If sufficient knowledge 
of follow-up is unavailable, the fourth alternative is preferred. 
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2.4. Asymptotic variance and covariance of the estimators. 


Formulas for the variance and covariance of the estimators may 
be determined by using the asymptotic property of maximum-likeli- 
hood estimators. The inverse of the asymptotic covariance matrix 


of the estimators, #, , Q.2,°*: , Q:- , is given by 
a Ewe | aL 
132) || | || Gen) | 
OD; | 2 Op, OQ. 
na i<c— ) 
eae nr eed eee cet |-------------- (48) 
z fE_)|" || af fo) 
Oy 8Qa/ || 1 EON dOn 
i (Hyp Se1 Oe RSE Seta) 


in which the elements are obtained by differentiating formula (41a) 
and (41b). Direct calculation gives the following mathematical ex- 
pectations 


—E(a°L/dp:) = M,[(1/p.) + (1/Qu)] + 2, (49) 

—E(O’L/dp. Qn) = M./Qn, for k = 2,-+-,c, (50) 

—E(0°L/0Q., 0Q..) = M./Q. » tor “hist kW sk a2. smi eoe (51) 
and 

—E@"L/8Q2,) = M.[0/Qa) + (1/Qn)], for k= 2,-+-,c, (52) 


where MW, and zm are defined by equation (11) and (12), respectively, 
of Part I. Substituting the respective expectations into (48), we have 


u+b)te) MQ, Mila On 
ato.” an Cag aire aaa ies 

oS fe ena alban Bl Neem 
M,/Qu 2 Kiet ware u(2+2) 
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with its determinant 


A = (M2/Qa +++ Q2Pz) + r(ME"/Qan +++ Qe) — p.). (4) 


Denoting the cofactors of the determinant by A,, , for h, k = 1, --- ,c¢, 
the formulas for the asymptotic variance and covariance are given by 
Ose = An ‘A ae (p.q-/M,) [1/{1 =e (rp.q2/M,)}], (55) 


1 + {rp.g./M,} 

fOPI Pep as, + <6, (56) 
C32.6n = Au/A = —(p.Qs/M.)[1/{1 + (rp.g./M.)}], 

POPS he oer Cs (57) 


Coe =a. ee Bi ieee (Q..(1 pe Q.x) ‘M,] E ae iwp,( Gz =e Q,+) /M (1 im aa) 


and 


Ober.602 = An/A = —QnQu/M.)[{1 + (rp./M,)}/{1 + (xp.9./M.)}], 
1On BRE. ON el fe ee ORC (58) 


Since the term Q,, was not explicitly included in the likelihood function 
(39), the formulas for the variance of Q,, and the covariances between 
Q., and other estimators were not presented. It is obvious by reason 
of symmetry, however, that expressions for Q,, do start from Q,, , 
which is to say that formulas (56), (57), and (58) hold also for k = 1. 

The quantities inside the square brackets in formulas (55) through 
(58) may be approximated with unity when M, is moderately large. 
These formulas then reduce to familiar expressions of the multinomial 
case: 


o3, = p.9:/M. , (59) 
og Oy te a Maen tonibh cl; 2 2sipe, (60) 
OR Oi = — Def M ) for k= ibs Pee Gs, (61) 


and 
OOzh,Qrk — —0,Q2+/M, ’ for h#A k; h, k= I 2, ese Czy (62) 


Formulas for the asymptotic variance and covariance of the esti- 
mators of the net and partial crude probabilities can be obtained with 
the same approach as employed in [7]. To save space, only two formulas 
are presented below. 


o2,., = (1 — 42.4) /Mip.4e] 
-[p, In (ea )iln (ie xp irr (i OE alls (63) 
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for bk = 1, <--> , ¢-for the net probability of death when risk R, is 
eliminated, and 


Cosh. = [(gz — Q1 — Q.x)/{M,(q- — OOO: 
+ [{Qrr.1(Ge — Qer) — Q.n}°/{M.p.9-(92 — Q.1)}] 


"[(@. — Qe1) + Q.rp-(In p./¢.)), (64) 
fork = 2, --- , ¢, for the partial crude probability. 


AN EXAMPLE OF LIFE TABLE CONSTRUCTION 
FOR THE FOLLOW-UP POPULATION 


The application of the methods developed in Parts I and II will 
be illustrated with data collected by the Tumor Registry of the Cali- 
fornia State Department of Public Health. The material selected 
consists of 5982 patients’ admitted to certain California hospitals and 
clinics between January 1, 1942, and December 31, 1954, with a diag- 
nosis of cancer of the cervix uteri. For the purpose of this illustration, 
the latter date is taken as the common closing date of the study; the 
date of entrance to follow-up for each patient is the date of hospital 
admission. 

The first step is to construct a table similar to Table 2, showing the 
survival experience of the patients grouped according to their with- 
drawal status. The interval length selected (column 1) will depend 
upon the nature of the investigation: in this case a fixed length of one 
year was convenient and satisfactory. The total number of patients 
admitted to the study is entered as N, in the first line of column 2, 
which in this example is 5982. To determine their withdrawal status 
in the first interval (0, 1) the patients were separated into two groups: 
admissions before 1954, and consequently at least one year before the 
close of the study; and admissions during the year 1954, all due for 
withdrawal since the study was terminated before their first anniversary. 
Of the patients admitted prior to 1954, so (4030 in column 3) survived 
to their first anniversary and 6, (1287) died during the first year. The 
deaths were further divided by cause into 6), (1105) deaths due to 
cancer of the cervix uteri and 6,, (182) deaths from all other causes. 
The survival status of the 1954 admissions is determined at the close 
of the study, as it is for patients due for withdrawal in any interval. 
In this study, wo (576) patients withdrew alive in the first interval, 
and ¢€ (89) patients died before the closing date. These deaths were 
again divided by cause into 5; (70, column 9) and ¢5 (19, column 10). 


7An additional 251 cases of uncertain survival status were deleted from this illustration. 
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TABLE 3 


SurvivaL Experience ArTeR DIAGNOSIS OF CANCER OF THE CpRvIx UTERI: 
Tue Main Lire TaBLe FUNCTIONS AND THEIR STANDARD ERRORS 


Interval 
since Estimated probability Observed 
diagnosis x-year survival rate of death in interval expectation 
(years) Doz (oa) of life at x 
z—-z+1 1000 poz 1000 S,,, 1000 Gr 1000 S3, éz S;, 
(1) (2) (3) (4) (5) (6) (7) 
0-1 1000.00 0.00 242.54 5.69 12.90 2.83 
1-2 757.46 5.80 181.43 6.26 15.86 3.74 
2-3 620.03 6.65 103.03 5.95 18.27 4.57 
3-4 556.15 AD 85.76 6.38 19.31 5.09 
4-5 508.46 7.33 64.13 6.50 20.08 | 5.56 
5-6 475.85 7.61 58.20 Wa2e 20.42 5.94 
6-7 448.15 7.95 43.76 7.34 20.65 6.31 
7-8 428.54 8.29 43.20 8.45 20.57 6.60 
8-9 410.03 8.71 33.69 8.85 20.48 6.89 
9-10 396. 22 9.17 46.55 PA 115) 20.17 7213 
10-11 Silat 9.98 43.85 14.30 20.13 7.47 
11-12 361.21 10.97 51.06 20.30 20.03 1th 
12-13 342.77 12.73 00.00 00.00 20.08 | 7.79 
13 342.77 12.73 — — 19.08 7.79 


Source: California Tumor Registry, Department of Public Health, State of California. 


The second interval began with the 4030 survivors from the first interval, 
which is entered as N, in line 2 of column 2. All 1953 admissions in- 
cluded in N, were due for withdrawal in the second interval. 

The main life table functions and the corresponding sample standard 
errors as shown in Table 3 are determined from the data given in Table 2. 
The x-year survival rate fo, is by definition equal to 1, divided by the 
radix ly , or Pop: ; --+ , Pz-1 . The sample variance of Poze IS computed 
from a formula given in a previous publication [6] (see also [13]): 


z-1 
2 A? A—-2 Q2 
Sic r. Poz Seis Si . 
u=0 


Formulas (18) and (14) were both used to compute the sample standard 
error of @, , with numerical results that were almost identical to the 
fourth decimal place. The figures appearing in column 5 of Table 3 
were obtained by formula (14). The observed expectation of life was 
determined from formula (17), for which #7 was set equal to pi, . 
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TABLE 4 
SuRVIVAL ExpERIENcE AFTER DiaGnosis or CANCER oF THE CERVIX UTERI: 
EstIMaTeD CrupE AND Nxt PRosaBiLities oF Deatu From Cancer 
OF THE CERVIX UTERI AND FROM OTHER CAUSES 


Estimated crude Estimated net 
probabilities of death probabilities of death 
in interval in interval 
Estimated (z, x + 1) from (x, + 1) when 
Interval probability — | — 
since of surviving Cervix Cervix 
diagnosis interval Cervix Other cancer cancer 
(years) (z; a =+- 1) cancer causes | Acting alone Eliminated 

z-@+1 1000 fz 1000 Qz: | 1000 Q:2 1000 Gr 1000 gr» 

(1) (2) (3) (4) (5) (6) 
0-1 757.46 207.11 35.43 APA, 39.77 
1-2 818.57 155.97 25.46 158.11 Ziv iel 
2-3 896.97 84.65 18.38 85.46 19.22 

3-4 914.24 62.89 22.87 63 .63 23.6 
4-5 835.87 44.40 19.73 44.85 20.19 
5-6 941.80 25.76 32.44 26.19 32.87 
6-7 956.24 PAT 20.59 23.41 20.84 
7-8 956.80 22.47 20.73 22.70 20.97 
8-9 966.31 14.44 19.25 14.58 19.39 
9-10 953.45 29.93 16.62 30.18 16.88 
10-11 956.15 24.36 19.49 24.60 19.73 
11-12 948.94 17.02 34.04 17.32 34.34 

12-13 1000 .00 — _— == a 


Source: California Tumor Registry, Department of Public Health, State of California. 


Table 4 shows the estimated probability of surviving each interval 
and the estimated crude and net probabilities of death from R, , cancer 
of the cervix uteri and R, , all other causes of death. Since only two 
risks are studied, the probability g,. is equal to g,., , the net proba- 
bility of death when cancer of the cervix uteri is eliminated as a risk 
of death from the population. For each interval the sum of #, , Qa , 
and Q,» is unity, and the estimated net probability Q,, is always greater 
than the corresponding crude probability Q., . 
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THE SPEARMAN ESTIMATOR FOR 
SERIAL DILUTION ASSAYS’ 


EuGENE A. JOHNSON AND Byron Wm. Brown, Jr. 


School of Public Health, University of Minnesota, 
Minneapolis 14, Minnesota, U.S.A. 


SUMMARY 


A new estimation procedure is presented for the estimation of the 
density of organisms in a suspension by a serial dilution assay. The 
point estimator is based on the Spearman technique, is simply and 
quickly computed without special tables, and has efficiency 88 percent. 


1. INTRODUCTION 


To estimate the density of a specific organism in a suspension, a 
common method is to form a series of dilutions of the original sus- 
pension. Then from each dilution a specified volume, hereafter referred 
to as a dose, is placed in each of several tubes. Later the tubes are 
examined for evidence of growth of the organism. 

Suppose k + 1 dilutions are used with concentrations of z; = a ‘Zo , 
a= 0, +1, +2, --- , +k. 2 is the highest concentration of original 
suspension used and a > 1 is the dilution factor. 

The following notation will be used: 


xz = Ing, 
n = the number of tubes receiving a unit volume (dose) for each 
dilution, 


T; the number of tubes showing signs of growth at dilution z; , 
De = Ti n,.8; =n 12,07 = 1 — Dp, . 


I 


The usual model specifies a density of organisms of @ per unit volume 
in the original suspension and a Poisson distribution of organisms in 
the individual doses. Thus the probability of signs of growth (ie., 
the probability of one or more organisms) in a tube receiving a dose 
at concentration z; is 

P@) =1—e°"". (1) 


The maximum likelihood estimator (frequently called ‘‘most prob- 
able number”) has been proposed [5] for estimating 6. Computational 


1Research supported, in part, by a National Institute of Health Research Training Grant. 
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procedures and tables have been published to eliminate the tedious 
computations involved [1, 4, 7]. Cochran [2] has given some guides 
for designing serial dilution assays using the most probable number. 

Fisher [5] proposed another estimator for 6. Computation is tedious 
for this estimator also. Tables have been published [6] for some of 
the common experimental designs. 


2. POINT AND INTERVAL ESTIMATION PROCEDURES 


The dose levels on the log scale, z = In z, are spaced d = In a units 
apart. The probability of a growth response at dose level x is 


F(z) =1—e"*. (2) 


The equidistant spacing and the “dose-response” function F(x) with 
the characteristics of a distribution function (increasing from 0 to 1 
with 2) are typical of the common bioassay problem in which the object 
is to estimate the median (L.D. 50) or mean (uw) or F(x). For such 
situations, Spearman [9] proposed the following estimator for u: 


p=atS—-d Dp, (3) 


where x) = In 2 , d = Ina, the summation is from2z = 0 to 72 = k, and 
p; is the proportion of tubes showing growth at dilution z; . F(x) 
has mean: 


fe) 


mM -/ vbere ** dx 


/ (In y — In de” dy, 


= — 7 —In@, 
where y = .57722 is Euler’s constant [8]. The parameter of interest 6 
can then be written: 
0 =¢ ‘e* (4) 


and the point estimator 6 of @ based on @ is” 
6 = or en = Poe EET (5) 
The point estimator of the number of organisms per z) volume is 


on6 re a aaae 6) 


2The point estimator 6 is biased. Its bias is discussed in Section 4 and a correction factor is given, 
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For convenience, the necessity of a logarithm table can be circumvented 
by constructing a graph of 26 as a function of >°p; , for commonly 
used values of a. Such a graph has been constructed (Graph 1) and 
its use is illustrated in the example in Section 6. 

If an interval estimate of @ is desired, the interval estimate of u 
is computed first and then the end points transformed by equation (5). 
The interval estimator of » (confidence coefficient of .95) is given in 
expression (7). The justification is presented in Section 3. 


eee 
Peds 1.964/— mart cls 96/2 ne (7) 


The corresponding interval estimator (C.C. = .95) for @ is 


: In 2 P 
6 exp Gomes ) <6 < bexp (1. Oo eet 2 2), (8) 


where 6 is given in equation (5). 


3. CHARACTERISTICS OF &@ 


First the values of d, n and x, will be assumed fixed. Then the 
case of z) randomly chosen will be discussed. In all cases it is assumed 
that the dose levels x; = In z; span an interval wide enough so that 
F(a.) > .99 and F(x, — kd) < .O1. 

The following notation will be used to differentiate between discus- 
sions with 2, fixed and discussions with x) random: 


E, denotes expectation with respect to the random variables p; 
for fixed 2p . 

E,, denotes expectation with respect to the distribution of 2 when 
Xp» is randomly chosen. 

E denotes expectation with respect to both 2) and the random 
variables p; . 


It follows from (2) and (3) that the expected value of @ for fixed 2 is 
d 
E,(f | 20) =% +5 -—4 Di F(a) 
d Zo—1d 
=m+5—4 > (lb —epl—te )). 0) 


Using (4) and (9), the bias of @ for fixed x , B(@ | x»), is 
B(a | to) = E,(A | to) — u 
= 2% + —d > (1 — exp [—#"""]) +y¥ + In 8. (10) 
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B(f | zo) does not depend on n. It does depend on d and on x. For 
a given value of d, the bias will have a maximum and a minimum over 
all possible positions of x, , subject to the conditions that F(a) > .99 
and F(a) — kd) < .01. Let B(fi | xo) be differentiated with respect to 2 : 


e [B(@ | 2.)] = 1 — d >> O exp (a — id) exp (— 6°“). (11) 


For a given d, if enough terms are carried to cover the desired range, 
it can be shown that (11) has two zeros with respect to x» on every 
interval of length d. These solutions yield the maximum and minimum 
values of the bias over all positions 2, for fixed d. Some results, obtained 
numerically, are: 


lA 


ia Ae d = In-4: Basle) |S 0019 
i= 10; i= In 10: (Bas ste 


(12) 


Using (2) and (8), the variance of @ for fixed % , V(@ | xo), can be written 
a 
V(@ | a) = A De EAE Kea (13) 


The sum in (13) can be approximated by the following integral. 
The approximation will be good if d is small and the range of dose 
levels x; is large. 


d\n 2 
n 


ap? 
Viale) =f F@l - F@)] de = (14) 
For a given value of d, V(@ | x») will depend on the position of 2p . 


The maximum values of V(@ | 2) can be obtained for given d by dif- 
ferentiating (13): 


(VA | 20) 


=, = De Rene exp (— de"? **) [2 exp (— 6e"°-**) - 1). (15) 


Zeros for (15) can be obtained numerically for any given d, using a 
range of doses large enough so that terms ignored in (15) are negligible. 
The values of x thus obtained can be substituted in (13) to obtain the 
maximum and minimum variance. The results below are presented 
relative to the approximate variance (14): 
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Via | m) — SP? 
a= 4, Gh ss Min ol: din 2 < .0056, 
Z (16) 
Via | m0) — 22 
ai== 10: a= Ini0: din2 SAU ire 
n 


The numerical results (12) and (16) indicate that, if the range of 
doses is wide enough and the dilution factor a is 10 or less, then the 
following approximations can be used regardless of the position of a : 


d In 2 
n 


E,(f | Xo) = Bb, Va | Xo) = (17) 


Suppose that zx, is randomly chosen with uniform density over an 
interval of length d, say (A, A +d). Then the expectation of uw can 
be taken with respect to x, and the p; : 


E@) = E.IE@| 2] = 5  Ey@| 20) dx 
a lm +$ 4- Oe. d De ed | dx 


(Aad) = =e ie ee 


A+d 


(As ag) = i F(x) dx. (18) 


Integrating (18) by parts: 


A+d 


E(@) = i __ af(@) de. (19) 


If the doses extend over a wide enough range, (19) indicates that 
fi has a negligible bias: 
E(f) = p. (20) 


For x) randomly chosen on (A, A + d) the variance of @ can be written: 


V(A@) a Ea ma yu)” a E,,(V(a | Zo) a Bra | Xo) | 


A+d 


= 2! S meg — Fle) dte + Ff BGI 28) dee. 


V(a) = ae F(a)[1 — F(a) dx + q ae Bq |) %o) dat - (21) 
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If the doses extend over a range such that the first integral in (21) 
is approximated to the degree desired by integrating over the infinite 
range, V(a@) can be written 

d fo} 1 A+d 
Via) = = / Bo) — F(@)] de + 5 i Bp | x) dtp. (2) 
—o A 
The first integral has the value (d In 2)/n. It follows from (22) that 
d\n 2 
= max B°(a | a)]. (23) 


= | 
d in 2 SG singe 
nN 


| Vii 


From (23) and the numerical results in (12) the following bounds for 
the approximation of V(f@) by (d In 2)/n can by obtained: 


d\n2 
| VG i apes | ms 
aq=4, d= ln 4: Ties =5(35 760) LOE. 
n 
d\n 2 | 
| v@ — SR : 
C= NO, d= In 10: ane < (1.09n)10™°. 
n 


A summary of the results (20) and (24), for randomly chosen 2, , 
would be essentially the same as (17), i.e., if the doses extend over a 
wide enough range, 

d\n 2 
n 


EH) =n, Va (25) 

In concluding the remarks on the properties of f, it should be re- 
marked that the distribution of @, conditional on 2 , is asymptotically 
normal, since it is a linear function (3) of the sum of k + 1 binomially 
distributed random variables. 


4, ESTIMATION OF @ 


The point estimator of 6 based on @ was defined in (5). The expected 


value of this estimator can be approximated for random choice of 2, 
as follows: 


E(6) a= Biles zl lee aes ae 
=: OMice ae | Bo 7 a, (26) 


i=0 
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If the expectation is taken termwise in (26), terms in moments of order 
three and higher are dropped, and the approximation (25) is used, 
(26) can be written: 


E(6) = if a din 2). (27) 


From (27) an estimator that is less biased than 6 is 


vate 2n - 
— on-+dIn2° 2) 
The coefficient of variation of 6, C.V. (@), can be approximated 
using (5) and (24): 


: ER alUe ve VB Se (29) 


n 


The approximation to the variance of f, given in (17) and in (25), 
is identical with the asymptotic variance of the estimator for In @ 
proposed by Fisher [5]. Therefore the asymptotic efficiency of the two 
procedures for estimating @ will be the same, 88 percent. 

The interval estimator of @ given in (8) follows directly from (4), 
(17) and (25), and the asymptotic normality of 2. 


5. DESIGN OF THE SERIAL DILUTION ASSAY 


In choosing the range of dilutions, the conditions F(a.) > .99 and 
F(a, — kd) < .01 should be satisfied. This implies that the expected 
number of organisms per dose at the highest and lowest concentrations 
should be greater than 5 and less than .01 respectively. 

In designing a serial dilution assay the precision desired of the 
estimator must be specified. This can be done by specifying the desired 
value of the coefficient of variation (29). Then equation (29) can be 
used to obtain 7, the number of tubes per dilution, for any dilution 
constant a desired. 

Alternatively the precision of the estimator can be specified in 
terms of the interval estimator of 6. Using (8), the precision of the 
interval estimator for @ can be expressed in terms of the factor, R = 
e 8 Vln" 5 1 which will be multiplied and divided into 6 to obtain 
the upper and lower limits of the 95 percent confidence interval. Graph 2 
shows the values of R as functions of the dilution factor a for values 
of n from one to ten. This graph can be used to design dilution assays 
having desired precision. See the example in Section 6. 
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6. EXAMPLE OF DESIGN AND ANALYSIS 


Suppose it is known that the density of organisms @ is between 
10° and 10° organisms per unit volume. Suppose a ten-fold dilution 
factor is to be used. To assure doses spanning the range .01 to 5 
organisms per dose, it is necessary to use concentrations ranging from 
10~* to 10~’ dilutions of the original suspension, or 7 dilutions altogether. 

Suppose the value of @ is desired to within approximately four fold 
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TABLE I 
ILLUSTRATIVE Dara 


Proportion of Tubes 


Dilution Showing Growth p 
1071 3/83 
10 Bye 
10-3 2/3 
10-4 0/3 
10-5 0/3 
10-6 0/3 
10-7 0/3 
Lp = 8/3 
a6 20 
n=l 
fy 10 
n=2 
R 5 = en5lk ; 
n=3 
, ie = 4 
n=4 
3 hos : 
n=7 
n=8 
neon 
2 
| 
fr 2 4 6 g e 4 
ie 
GRAPH 2 
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in either direction (i.e., the desired value of R is approximately 4). 
From Graph 2, n = 3 tubes per dilution will give R = 4.2 for a dilution 
factor of 10. This means a total sample size of 7n = 21 tubes. In 
Table 1 some data and computational results are presented for such 
a design. 

Using Graph 1, 2.6 = 80 organisms per dose at the highest concen- 
tration, 10°*. Therefore, 6 = 800 organisms per unit volume of the 
original suspension. Without Graph 1, 6 would be computed as follows: 


i=in 10? pS Gn 10)(3) a7 2015. 


6 = e ’* = 824 organisms per unit volume of the original suspension. 
The interval estimate of 6 is obtained by using the factor R (= 4.2) 
read from Graph 2: 


824 er 
To <9 < 8244.2), 


196 < 6 < 3460 (confidence coefficient of .95). 


The point estimate 6 can be corrected for bias using equation (28): 
Ar 2n 


= niaesa eo 6 = (.79)(824) = 651 organisms per unit volume. 
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THE FITTING OF A GENERALIZATION OF 
THE LOGISTIC CURVE 


J. A, NELDER 
National Vegetable Research Station, Wellesbourne, England 


This paper describes the fitting of the four-parameter family of 
curves defined by the differential equation. 


aW - s ae 
aie = cw - (4) | (1) 


Special cases were originally proposed by Piitter [1920] for various 
types of animal growth (see e.g. von Bertalanffy [1957]), and recently 
Richards [1959] has exemplified the general form of the curves and 
suggested that they may be useful for the empirical description of 
plant growth. For further details of the history of these curves and 
their mathematical properties reference should be made to Richards’ 
paper. It suffices to say here that the family defined by (1) includes 
as special cases several curves which have been used empirically for 
the description of growth, including the ‘monomolecular’ (diminishing 
returns) curve (@ = —1), the exponential curve (6 — 0 through positive 
values), the logistic curve (@ = 1), and the Gompertz curve (@ — © 
with A fixed and « a linear function of 6). 


MODEL AND NOTATION 


In using (1) in the description of growth, W will usually denote a 
weight of some kind, while ¢ is usually chronological time, but may be 
a suitable ‘time scale’ (Skellam et al. [1959], Nelder et al. [1960]) which 
can replace chronological time when the environment is variable. Since 
it seems to be characteristic of many growth phenomena that the 
relative growth rate (W~' dW/dt) is almost constant when the weight 
is small compared to the final weight, we shall restrict 6 to be positive 
in what follows. For if 6 < 0, then, for W sufficiently small, W~* dW /dt 
becomes as large as we please in absolute value. When 6 > 0, the 
solution of (1) can be written in the form 


W = Aufl a DENS aa (2) 
89 
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where \ is the constant of integration. A and « are taken as positive 
since W is positive and is assumed to increase with time. Thus we 
satisfies the logistic equation with upper asymptote A’ ast > ~, 
while for ¢ large and negative 


Ww Acie 
We now write Y = In W and a = In A so that 
Y ae a a 6 In (1 ate hs (3) 


where In denotes the natural logarithm. (3) is the form of the equation 
that will be used for fitting, and we shall assume that if y = In w is 
a sample value of Y = In W at time t, then E(y) = Y, and var y = o” 
independently of Y. We shall also assume that the y; (¢ = 1, --- , ) 
obtained at time t; , i.e. the sample points to which the curve is to be 
fitted, are independent. The independence condition is satisfied for 
instance in plant growth analysis when a randomized field layout is 
used, and the sampling is destructive, i.e., a different set of plants is 
taken on each occasion. When the same set of plants is used on each 
occasion (when w might denote leaf area measured non-destructively) 
it is still possible to use the method proposed for estimation, though 
of course sampling variances cannot be obtained from residual mean 
squares ‘within curves’ if the condition of independence is not satisfied. 
This situation is further discussed later in the paper. 

The assumption that y = In w has constant variance has been found 
to be reasonable from an examination of data on the weight of part 
or the whole of several crops. In fact it is usually found that the 
distribution of y does not differ significantly from a normal distribution. 

For the purposes of this paper, (2) will be described as the generalized 
logistic equation, though it is not of course the only generalization that 
could or has been made from the logistic equation. 


FITTING THE LOGISTIC CURVE 


We consider first the case where @ is known. This is equivalent 
to fitting the logistic equation, since by putting W’ = W’”’, Y’ = Y/o, 
a’ = a/, ’ = d/6 and x’ = «/@ we can convert (2) into the logistic 
function. We may, therefore, put @ = 1 for this case without loss of 
generality. Let us write 
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then from (3) 


OY; OY; ar. 
Y; = In = {84 .= *=—€. ee iS 
a + d E.), 0a Ky On g; ? and OK E,t; d 
Thus the least-square equations are given by 
De Gi Y= 0, 
2 (y: — Y.)& = 0, (4) 


2s Yi Y)ét; = 0. 


These equations have no explicit solution in general, so that they must 
be solved by iteration. The usual method, using the expected values 
of the information matrix (see e.g. Bailey [1951]), gives the following 


TABLE I 
EXTENSION OF BerKson’s TABLE or AntI-Loaits [1/1 + e-7)] 


T 0 wp 2 3 a 5 6 it 8 9 
0.99 
5.0 331 337 344 350 = 357 363 369 376 382 388 
5.1 394 400 406 412 418 423 429 435 440 446 
5.2 451 457 462 467 473 478 483 488 493 498 
5.3 503 508 513 518 9523 527 532 537 541 546 
5.4 550 555 559 564 568 572 576 581 585 589 
5.5 593 597 601 605 609 613 617 620 624 628 
5.6 632 635 639 642 646 649 653 656 660 663 
5.7 667 670 673 676 680 683 686 689 692 695 
5.8 698 701 704 707 ~=—710 713 716 719 721 724 
ong 727 730 732 735 = 737 740 743 745 748 750 
6.0 753 755 758 760 762 765 767 769 7712 774 
6.1 776 778 781 783 = 785 787 789 791 793 795 
6.2 797 799 801 803 = 805 807 809 811 813 815 
6.3 817 819 820 822 824 826 827 829 831 832 
6.4 834 836 837 839 841 842 844 845 847 848 
6.5 850 851 853 854 = 856 857 859 860 861 863 
6..6 864 865 867 868 869 871 872 873 875 876 
6.7 877 878 879 881 882 883 884 885 887 888 
6.8 889 890 891 892 893 894 895 896 897 898 
(8 899 900 901 902 §=9038 904. 905 906 907 908 


To obtain £, from this table and Berkson’s table in Berkson [1953], use the formula, — = antilogit 
(—T) = 1 — antilogit (7). 


For T positive —In (1 — €) = tabulated value. 


TABLE II 


VALUES OF 104 In (1 + e7) 


For 7 negative —In (1 — €) = |r| + tabulated value for |r]. 
iF 0) 1 2 3 4 5 6 a 8 9 
0.0 6931 6882 6832 6783 6733 6685 6636 6588 6539 6492 
0.1 6444 6397 6349 6303 6256 6210 6163 6118 6072 6027 
0.2 o98t 5936 5892 5847 5803 5759 £5716 5672 5629 5586 
0.3 5544 =5501 5459 5417 5875 5834 5293 5252 5211 5170 
0.4 5130 5090 5050 5011 4972 4932 4894 4855 4817 4779 
0.5 4741 4703 4666 4629 4592 4555 4518 4482 4446 4410 
0.6 4375 4340 43804 4270 4235 4201 4166 4132 4099 4065 
0.7 4032 3999 3966 39383 3901 3869 3837 3805 8773 3742 
0.8 3711 3680 3649 3619 3589 3559 3529 3499 3470 3441 
029 3412 33838 3354 3326 . 3298 3270 3242 3214 3187 3160 
10 3133 3106 3079 3053 3027 3001 2975 2949 2924 2898 
Ih als 2873 2848 2824 2799 2775 2751 2727 2703 2679 2656 
VP 2633 2610 2587 2564 2542 2519 2497 2475 2453 2432 
Ih 503) 2410 2389 2368 2347 2326 2305 2285 2264 2244 29294 
1.4 2204 2184 2165 2146 2126 2107 2088 2070 2051 2032 
1.5 2014 1996 1978 1960 1942 1925 1907 1890 1873 1856 
ho 1839 1822 1806 1789 £1773 1757 174 1725 1709 1693 
iL na 1678 1662 1647 1632 1617 1602 1588 1573 1558 1544 
1.8 1530 1516 1502 1488 1474 1460 1447 1433 1420 1407 
1.9 1394 1381 1368 1855 1343 1330 1318 13806 1293 1281 
2.0 1260257 24 Ge 234 D2 ee Oa 1200 1188 1177 1166 
Pe | 1155 1144 1134 1123 1112 1102 1091 1081 1071 1061 
22 1051 =1041 1031 1021 1012 1002 993 983 974 965 
2.3 955 946 937 928 920 911 902 894 885 877 
2.4 868 860 852 844 836 828 820 812 804 796 
275 789 781 774 766 759 (o2 745 HOG 730 723 
2.6 716 710 703 696 689 683 676 670 663 657 
Diet 650 644 638 632 626 620 614 608 602 596 
2.8 590 585 579 573 568 562 557 551 546 541 
2.9 536 530 525 520 515 510 505 500 495 491 
3.0 486 481 476 472 467 463 458 454 449 445 
Biol 441 436 432 428 424 420 415 4i1 407 403 
one 399 396 392 388 384 380 Ou 373 369 366 
Bots 362 359 355 352 348 345 341 338 335 331 
3.4 328 325 322 319 316 312 309 306 303 300 
ono 297 295 292 289 286 283 280 278 275 272 
3.6 270 267 264 262 259 257 254 252 249 247 
Ba 76 244 242 239 237 250 232 230 228 226 223 
Be 221 219 217 215 213 211 208 206 204 202 
Sy 8) 200 198 196 195 193 191 189 187 185 183 
4.0 181 180 178 176 174 173 ‘wal 169 168 166 
Ara 164 163 161 160 158 156 155 153 152 150 
Ae 149 147 146 144 143 142 140 139 Iesye 136 
4.3 135 133 132 Tesi 130 128 127 126 124 123 
4.4 122 121 120 118 117 116 iN) 114 i183 112 
4.5 110 109 108 107 106 105 104 103 102 101 
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TABLE Il—(Continued) 


LP Ui We an 
4.60 100 5.10 61 5.60 37 6.10 22 6.60 14 
4.70 91 5.20 55 5.70 33 6.20 20 6.70 12 
4.80 82 5.30 50 5.80 30 6.30 18 6.80 11 
4.90 74 5.40 45 5.90 27 6.40 17 CaO 
5.00 67 5.50 41 6.00 25 6.50 15 = == 


adjustments dap , 6Ao , dk to initial ‘guesses’ a , Ao , Ko for the param- 
eters=— 


in Ds Ei yy E:t; || devo 2 Ue = 9G) 
| > E; > Et; dX | = > E(y; le) (5) 
L ps El; Ko ys Eby; = Y,) 


where £; is evaluated with X = \, and x = x. New values of £; and Y, 
are then recalculated using a, = a + da, etc. and further adjustments 
obtained, the process being repeated until whatever accuracy required is 
achieved. Note that the information matrix depends on the parameters 
only through 7; ; the process of solution of (5) would obviously be con- 
siderably speeded up if tables of €; and In (1 — £€,;) as functions of 7; 
were available. 

Now é = antilogit (—7) = 1 — antilogit (7) in Berkson’s [1953] 
notation; hence € can be obtained for the range —4.99 < 7 < 4.99 
from the table of antilogits given in that paper. An extension of this 
table for 5 < | 7 | < 6.99 which covers the region down to W/A = 0.001 
and up to W/A = 0.999 is given in Table I. Values of In (1 + e7’) 
to four decimal places are given in Table II for —6.99(0.01)6.99 from 
which In (1 — &) can be quickly derived. These tables have been 
computed from the tables of e” and In x given in Comrie [1949] and 
should prove sufficiently accurate for most purposes. If linear inter- 
polation is used from the nearest tabulated value, using the tabulated 
first difference, rounding-off errors will rarely exceed 1 unit in the 
last place. 

Because Y; is linear in a, knowledge of its approximate value is 
not necessary for the iterative process using the expected information 
matrix, and we can replace day in (5) by a, , and Y by In (1 — £), solving 
directly for a; , 59 and 6x9 . However when we come to consider the 
exact equations for the iteration, using the empirical information matrix 
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instead of the expected one, we shall find no advantage in suppressing 
a , so that it will be retained in what follows. The calculations can 
be conveniently laid out as follows: 


TABLE III 
ComputinG Layout For FitTinc THE LOGISTIC CURVE 


The ¢; are entered in column 1, whence the 7; are calculated using the 
starting values of \ and x, and entered in column 2. Column 3 is filled 
up from Table I, and then column 4 calculated. Column 5 is filled up 
from Table II, and the y’s entered in column 6. a is estimated by 
yj — >> n (1 — &)/n from the sums of columns 5 and 6, and hence 
Yi — Y; = ys — a — In (1 — &) is calculated. The coefficients on 
the left-hand side of (5) can now be calculated from columns 3 and 4, 
by summing the columns and from sums of squares and products of 
f; and £,t; . Finally the right-hand side of (4) may be calculated 
from >) (y; — Y;), which should be zero, apart from rounding errors, 
and from products of columns 3 and 4 with column 7. The equations (4) 
may now be solved using any of the standard techniques (see e.g., 
Dwyer [1951]). 


The exact iterative equations 


If we now denote the parameters to be fitted by 6; , then the above 
method uses the expected information matrix whose general term is 


sr ov oY, 
~ 00, 006, 


The exact equations for the generalized Newton-Raphson process are 
those using the empirical information matrix with general term 


Ono omy 
> FY ac ee LY. — Fe) 568," 


Cornfield and Mantel [1950] for the analogous situation in probit 
analysis found a substantial improvement in the speed of convergence 
using this latter form, so that it may be worth considering the ap- 
propriate modification here. This necessitates adding to the left-hand- 
side of (5) the matrix 
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0) 0 0 
2 (y; — Y.)&{1 — &) SS (y; — Y.)&(1— &,)t; 
E y (yi — Ye — Et: 


and two more columns need to be added to Table III, namely 
(y — Y)(1 — &) and £¢’ in order to compute the elements of this matrix. 
The extra calculation is not large but the increase in speed of convergence 
achieved may justify the extra labour only when a’ is large so that the 
(y; — Y;) are appreciable. 

The dispersion matrix of the estimates of the parameters may be 
obtained in the usual way by inverting the information matrix on the 
left-hand side of (5) and then multiplying it by an estimate of o’. 
For methods of matrix inversion, and the combination of the inversion 
with the solution of an associated set of equations, see e.g., Dwyer [1951]. 

If a variance for A rather than @ is wanted, then we may use the 
approximate formula 


A 


var A = vare* ~e* varé@é = A’ varé 


provided that var A is small compared to A’. 


Obtaining the starting values 


The starting values may be obtained in various ways, some wholly 
graphical and some partly so. Some compromise must be made between 
the desirability of having them as accurate as possible so that iterations 
are reduced to a minimum, and the undesirability of spending an 
excessive amount of time on a preliminary stage of the computations. 

A semi-graphical method which I have found useful is as follows. 
First plot the logistic variable w’’’, henceforth denoted by x, against t, 
and estimate by eye the value of A; call this estimate A’. Then plot 
In [x/(A’ — 2x)] against ¢t. If the logistic function is the correct one 
and A’ is also correct, this should give, apart from random errors, a 
straight line. Now the variance of In [x/(A — 2)] using the first-order 
approximation, is given by [A/(A — «)]’o° = £o” in the notation 
already established. This variance increases as x increases, tending 
to infinity as x tends to A. Thus when the variance of x is constant, 
the graph of In [z/(A’ — )] against ¢ will show increasing scatter as 
‘g approaches A. In addition, if x has a symmetrical distribution, that 
of In [x/(A — 2x)] will be positively skewed for x near A and positive 
deviations will be in mean value much greater than negative ones. 
If A’ is too high, then the graph of In [x/(A’ — )] against ¢ will show 
a tendency to level off at In [A/(A’ — A)], while if A’ is too low, the 
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graph will show an increasing slope as x approaches A. If the graph 
shows a sigmoid shape, the logistic equation is inappropriate. Assuming 
that no sigmoid shape has been found, we may obtain an improved 
estimate of A as follows. Divide the points into two sets, with the 
dividing line roughly where « = A’/2; then points with « < A’/2 are 
relatively slightly affected by errors in A, and have the smallest vari- 
ances. Fit by eye a straight line to the points on the (In [z/(A’ — 2)], t)- 
graph using the points where x < A’/2 and giving most weight to the 
smaller values of x. Read off the fitted values of z/(A — 2) on this 
line corresponding to the values of ¢ for points having x > A’/2. If 
z is such a fitted value, then, ignoring errors, 2 = «/(A — x) and so 
A = x(1 + z)/z. Hence if we write u = x(1 + 2)/z, then wu is an esti- 
mate of A. To obtain an improved value for A, calculate u for each 
point having « > A’/2 and use @, the simple mean of the u’s. The 
method is quite rapid in practice if a logarithmic graph paper is used, 
since then In [x/(A’ — 2)] can be plotted directly from a knowledge 
of x/(A’ — x) and z can also be read off directly from the fitted line. 
If the improved value of A still gives curvature, the process may be 
repeated. Having obtained a good value for A, , the starting value 
for A, we may now calculate ») and x either by a graphical fitting 
of a straight line to the graph of In [x/(A, — x)] against ¢, using the 
relation In [X/(Ao — X)] = X + xt or by a weighted regression of 
In [z/(Ay — 2)] on ¢ using as weights = [(A, — x)/A,}’. When 


TABLE IV 


CALCULATIONS TO OBTAIN STARTING VALUES FOR Fitting 
THE Logistic CurvE 


t ae xz/(71 — x) u = «(1 + z)/z x/(73.2 — x) 
—2.15 Rall 0.0529 0.0513 
—1.50 6.25 0.0965 0.0934 
—0.85 9.54 0.155 0.150 
—0.08 16.91 0.313 j 0.300 
+0.52 24.51 0.510 0.503 

1.10 33.78 0.908 0.857 

2.28 50.00 2.38 719 216 

BR 62.05 6.93 74.0 5.57 

4.00 69.34 41.8 HOD 18.0 

4.65 67.09 NSA ile 11.0 

5.00 69.34 41.8 72.4 18.0 

Mean 73.2 
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o is small, the fit by eye should be quite adequate, but the weighted 
regression may have some advantage when the scatter about the line 
is considerable. If logarithmic paper is used, it must be remembered 
that one cycle corresponds to log, 10 = 2.30259 units when the slope 
is to be measured. 


Worked example 


The data in Table IV relate to the growth of carrot tops in a field 
experiment, and were obtained by my colleague Mr. R. B. Austin. 
6 has been taken as known and equal to 2, so that 2, the logistic variable, 
equals W’’’; tis a time scale based on total incoming radiation (negative 
values occur because an origin had been taken with a zero between 


o-ol 


=? =] ° I 2 3 4 5 t fore and x 


FIGURE 1 
‘MEruop oF OBTAINING STARTING VALUES FOR a, i, kK, FROM DATA OF TABLE IV. 


@ Values of z/(71 — z) plotted on logarithmic scale against t. : ; 
X Fitted values of Z = (4/A — 2) obtained from line drawn by eye through points having 
zx < 35.5. 
O Values of x/(73.2 — x) plotted on logarithmic scale against t. 
[Note: t— scale displaced to avoid overlapping] 
-—- Line fitted by eye through points O. 
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the fourth and fifth reading; the reasons for this have no relevance for 
this example). ‘Table IV also contains the preliminary calculations 
necessary to arrive at Ay , the starting value for A. An initial plot 
of x against t gave an A’, as judged by eye, of 71. 2x/(71 — x) was 
then calculated and recorded in the third column of Table IV. Figure 1 
shows the graph of In [x/(71 — x)] against ¢. From the solid line fitted 
by eye to the points having x < A’/2 = 35.5, the fitted values of 
2/(A — 2x) for the last 5 points were read off, with w calculated for 
each, and recorded in the fourth column of Table IV. The revised 
value of A’ is @ = 73.2. Values of x/(73.2 — x) were then calculated 
(fifth column of Table IV) and plotted against ¢ (Figure 1); the broken 
line was fitted by eye, giving most weight to the small values of z, 
and gave \) = —1.109, k = 0.861. The calculations for the exact 
iterative method discussed above can now be started. Filling up 
Table III we have the following: 


t; T% &5 Ext; nC = "s;) Yi Ne = Vs 
—2.15 —2.960 0.9507 —2.0440 —3.0105 e272 —0.0020 
—1.50 —2.400 0.9168 —1.3752 —2.4868 1.832 +0 .0343 
—0.85 —1.841 0.8631 —0.7336 —1.9883 2.255 —0.0412 
—0.08 —1.178 0.7646 —0.0612 —1.4464 2.828 —0.0101 
+0.52 —0.661 0.6595 +0.3429 —1.0773 3.199 —0.0082 
+1.10 —0.162 0.5404 0.5944 Series 3.520 +0.0129 
+2.28 +0.854 0.2986 0.6808 {Na 7/ 3.912 —0.0178 
+3.23 +1.672 0.1582 0.5110 =()), eee 4.128 +0.0157 
+4.00 +2 .335 0.0883 0.3532 —0.0924 4,239 +0 .0469 
+4.65 +2.895 0.0524 0.2437 —0.0538 4.206 —0.0247 
+5 .00 +3.196 0.0393 0.1965 —0.0401 4.239 —0.0054 


7 = 3.2391, (1 — &:)/n = —1.0454, ao = 4.2845, S.S. = 0.00661578. 


Adjustment equations are 


| 11.0000 5.3319 —1.2915 || 6a, 0.0004 | 
3.9272 —3.0008 || 6\./ = | —0.0124 |- 
8.0293 || dx, —0.0020 | 
Solution is given by 
dao = 0.010025, a, = 4.2940, 
do = —0.021566, te 1 = —1.1300, 
ko = —0.006746, iy = 0.8544, 
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If a further iteration is required then a, should be estimated in 
the same way as a , ie. using 7 — >> In (1 — é;)/n, and the value 
obtained above ignored. However the adjustments obtained from the 
first cycle are sufficiently small to justify our stopping in this case. 

The goodness of fit of the logistic curve can now be checked by 
comparing the residual mean square after fitting with the residual error 
obtained from replication. Using the estimates a, , 1 , 1 , we have 
a residual sum of squares after fitting of 0.00642 giving a mean square 
with 11 — 3 = 8 df. of 0.00080. The error mean square was 0.0033 
so that the fit is satisfactory. We may now obtain the dispersion 
matrix by inverting the information matrix and multiplying it by the 
variance of a single y; , as estimated by the error mean square. Thus 
the dispersion matrix is given by 


| 11.0000 5.3319 —1.2915 |"? 0.00137 —0.00237 0.00066 
0.0033 X 3.9272 —3.0008; = 0.00526 0.00159 |. 
8.0293 0.00090 


The standard errors of the estimates affect the second place of 
decimals, thus justifying the stopping of the iterative process after 
one cycle. 


FITTING THE GENERALIZED LOGISTIC CURVE 
If we now define 7; as (A + xt;)/6, then from (2) 
Y=a+t @ln(1 — &) 
where é = e “/(1 + e ’) as before, and 


2 eae OV cen Oh Gre Ny ORE Le 
as E; ) OK = Ejt; ) 00 — In (1 E;) Tigi B; ] say. 


The least-square equations, therefore, are given by 
G-Yy=0, Ly — Yost = 0, 
oe (yi — Y,)& = 0, Ds (y; — Y,)8; = 0 


and the iterative solution from starting values a , Ao , ko , % , gives 
adjustments 6a, etc. from the solution of 


m De Dk DB: TY] ba “ys - Yd) 
Dee Leet DBs ff Mo] _ | Ds — KH |, oy 
DEE DS Bets I! 8Ko yi — Vode 
>» B; 68 > (Ve — ae; 


(6) 


~ 
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TABLE V 
VaLurs or —1048, — B = r/(1 + e-7) — In(1 + e7) 


T 0 1 2 3 4 5 6 a 8 9 
0.0} 6931 6931 6931 . 6930 6929 6928 6927 6925 6923 6921 
0.1} 6919 6916 6913 6910 6907 6903 6900 6895 6891 6887 
0.2} 6882 6877 6871 6866 6860 6854 6848 6841 6834 6827 
0.3 | 6820 6813 6805 6797 6789 6781 6772 6763 6754 6745 
0.4 | 6735 6726 6716 6706 6695 6685 6674 6663 6652 6640 
0.5| 6628 6617 6605 6592 6580 6567 6554 6541 6528 6515 
0.6 | 6501 6487 6473 6459 6445 6430 6415 6400 6385 6370 
0.7} 6355 6339 6323 6307 6291 6275 6258 6242 6225 6208 
0.8; 6191 6174 6157 6139 6122 6104 6086 6068 6050 6031 
0.9} 6013 5994 5976 5957 5938 5919 5900 5881 5861 5842 
1.0} 5822 5802 5782 5763 5743 5722 5702 5682 5662 5641 
tho 5620 5600 5579 5558 5537 5516 5495 5474 5453 54382 
1.2} 5411 5389 5368 5346 5325 5303 5281 5260 5238 5216 
1.3} 5194 5172 5150 5128 5106 5084 5062 5040 5018 4996 
1.4| 4974 4951 4929 4907 4885 4862 4840 4818 4795 4773 
1.5 | 4751 4728 4706 4683 4661 4639 4616 4594 4571 4549 
1.6] 4527 4504 4482 4460 4437 4415 4393 4370 4348 4326 
1.7] 4804 4282 4259 4237 4215 4193 4171 4149 4127 4105 
1.8] 4083 4061 4039 4018 3996 3974 3952 3931 3909 3887 
1.9|] 3866 3844 3823 3802 3780 3759 3738 3717 3695 3674 
2.0} 3653 3632 3611 3591 3570 3549 3528 3508 3487 3467 
2.1| 3446 3426 3406 3385 3365 3345 3325 3305 3285 3265 
2.2} 3245 3226 3206 3186 3167 3147 3128 3109 3090 3070 
2.3} 3051 3032 3013 2994 2976 2957 2938 2920 2901 2883 
2.4) 2864 2846 2828 2810 2792 2774 2756 2738 2721 2703 
2.5 | 2685 2668 2650 2633 2616 2599 2582 2565 2548 2531 
2.6 | 2514 2497 2481 2464 2448 2431 2415 2399 2383 2367 
2.7 | 2351 2335 2319 2303 2288 2272 2257 2241 22996 92211 
2.8] 2195 2180 2165 2150 2136 2121 2106 2091 2077 2062 
2.9} 2048 2034 2020 2005 1991 1977 1964 1950 1936 1922 
3.0} 1909 1895 1882 1868 1855 1842 1829 1816 1803 1790 
3.1] 1777 1764 1752 1739 1726 1714 1702 1689 1677 1665 
3.2] 1653 1641 1629 1617 1605 1594 1582 1570 1559 1547 
3.3] 1586 1525 1514 1502 1491 1480 1469 1459 1448 1437 
3.4] 1426 1416 1405 1395 1884 1874 1364 1354 1344 1333 
3.5] 1323 1314 1304 1294 1284 1274 1265 1255 1246 1236 
3.6] 1227 1218 1208 1199 1190 1181 1172 1163 1154 1146 
3.7) L137> 1128 1120 1111 2102 1094" “1086 = 107771060 106 
3.8} 1053 1045 1037 1029 1021 1013 1005 997 989 982 
3.9 974 967 959 952 944 937 930 922 915 908 
4.0 901 894 887 880 873 866 859 853 346 839 
4.1 833 826 820 $13 807 800 794 788 782 775 
4.2 769 763 757 751 745 739 733 728 722 716 
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TABLE V—(Continued) 


T 0 1 2 3 4 5 6 a 8 9 
4.3 710 705 699 694 688 682 677 672 666 661 
4.4 656 650 645 640 635 630 625 620 615 610 
4.5 605 600 095 590 586 581 576 572 567 562 
4.6 558 553 549 544 540 536 531 527 523 518 
4.7 514 510 506 502 498 494 490 486 482 478 
4.8 474 470 466 462 458 455 451 447 444 440 
49 436 433 429 426 422 419 415 412 408 405 
5.0 402 398 395 392 389 386 382 379 376 BY is) 
5.1 370 367 364 361 358 359 352 349 346 343 
5.2 340 337 330 332 329 326 324 321 318 316 
5.3 313 310 308 305 303 300 298 295 293 290 
5.4 288 285 283 281 278 276 274 271 269 267 
5.5 265 262 260 258 256 254 252 249 247 245 
5.6 243 241 239 237 235 233 231 229 227 225 
5.7 223 222 220 218 216 214 212 211 209 207 
5.8 205 204 202 200 198 197 195 193 192 190 
5.9 188 187 185 184 182 181 179 178 176 175 
6.0 173 172 170 169 167 166 164 163 162 160 
6.1 159 158 156 155 154 152 151 150 148 147 
6.2 146 145 143 142 141 140 138 137 136 135 
6.3 134 133 132 130 129 128 127 126 125 124 
6.4 123 122 121 120 119 118 116 116 114 114 
6.5 113 112 aes 110 109 108 107 106 105 104 
6.6 103 102 101 101 100 99 98 97 96 95 
6.7 95 94 93 92 91 hit 90 89 88 88 
6.8 87 86 85 85 84 83 82 82 81 80 
6.9 80 79 78 WO, U0 76 75 75 74 73 
7.0 73 72 72 ffl 70 70 69 69 68 67 


Little amendment is needed to Table III for the solution of these 
equations. A column for 8 must be inserted between columns 4 and 5, 
and 8 can be obtained rapidly from Table V. ap is estimated as ¥ — 
G: >. In (1 — &:)/n, and y — Y; = y: — a — % In (1 — &). All 
the elements in (7) can then be calculated. 


Obtaining the starting values 


A sign that @ is not equal to unity is the appearance of a trend 
in the values of u during the graphical process for finding Ay . If 
§ > 1, then values of w show a decreasing trend as w departs from A, 
and the lowest of them are often below the highest values of w. Con- 
versely if 9 < 1, then u shows an increasing trend as w falls. According 
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to the trend shown by the wu’s, 6 can be changed in the appropriate 
direction, and when @, has been obtained, with the corresponding A”, 
we can use the relation 


a? 1 
pn iy + kh)=7 
to get estimate of Ao/4> and xo/% and hence of X) and x . For the 
calculations of coefficients in (7), however, only t and 7 are needed. 


Worked example 


In the example above of fitting the logistic curve to W*””, we were 
in fact fitting the generalized logistic curve taking @ as known and 
equal to 2. We now consider the fitting when @ is not assumed known, 
using the value 2 as a starting point. Columns 1-5 of Table III are 
unchanged but y in column 6 is now twice the previous value since we 
were working before with In z”” = 4Inz. Column 7 will also be doubled 
and a column of 8; must be added. The revised and added columns 
are given together with the iterative equations in Table VI. 


TABLE VI 


REVISED AND ADDITIONAL COLUMNS FOR FITTING THE GENERALIZED 
Loaistic CURVE 


Bi Yi We = Ve 
—0.1964 2.544 —0.00388 
—0.2865 3.664 +0.0688 
—0.3992 4.510 —0.0824 
—0.5457 5.656 —0.0200 
—0.6414 6.398 —0.0162 
—0.6895 7.040 +0.0252 
—0.6096 7.824 +0.0356 
—0.4866 8.256 +0.0314 
—0.2987 8.478 +0.0942 
—0.2054 8.412 —0.0494 
—0.1657 8.478 —0.0106 


Adjustment equations are 


11.0000 5.3319 —1.2915 —4.4747]] 8a) ] 0.0016 
3.9273 —3.0008 —2.3015 || 6. b__ | —0.0244 
. 8.0293 —0.3343 II 8x0 —0.0048 


2.1857 JL66o 0.0096 
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The solution of these equations is given by 
day = —0.016240, 


d\) = —0.201431, 
5k = —0.089091, 
50) = —0.254585, 
whence 
x 
1 = o,(%2) + 6r. = 2(—1.109) — 0.2014 = —2.4194, 
0 


kK = o,( #2) + dx, = 2(0.861) — 0.0891 = +1.6329, 
0. 


> 
l| 


2 — 0.2546 = 1.7454. 


a, is not given because it is better estimated from 7 — 0, >. In (1 — &;)/n 
in the next iteration, as described above. 

The rather large and approximately equal adjustments to A» and 6 
should be noted; the estimates of these parameters are in fact highly 
correlated and have relatively large variances; hence the reduction of 
the S. S. of residuals (from 0.0265 to 0.0239) following the adjustment 
is rather small. 

A second iteration using the values a, , \, , 1 , 0, gives 


da, = —0.00446770, Oe = 8.5538, 
6\, = +0.02565976, Pe. Ae = 2.3987, 
6x, = +0.00782844, ko = 1.6407, 
66, = +0.02010622, 6, = 1.7655. 
and a third and final iteration gives 

a3 = 8.5539, 

As = 2.0917, 

eA 

6, = 1.7676 


with dispersion matrix 
0.01419 0.02379 0.01526 0.06075 


0.138220 0.06751 0.21618 || 
0.03705 0.11775 
0.40847 
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In comparing the estimates and their variances and covariances with 
those obtained for @ assumed known and equal to 2, it should be remem- 
bered that the effective parameters estimated in the latter case were 
a/3 = a/2 etc. Thus parameter values for the logistic case should 
be multiplied by 2 and their variances and covariances by 4 for com- 
parison with those of the four-parameter curve. As was to be expected, 
the variances of &, 4, and & have been considerably increased by the 
inclusion of 6, also 6 itself is poorly determined with a standard error 
of +0.639, so that it does not differ significantly from the value of 2 
originally used. 


ALLOCATION OF SAMPLE POINTS 


This section is concerned with the effect on the efficiency of esti- 
mation of different allocations of position of sample points on the curve. 
Obviously points on the curve where x’ is small compared to A‘”’ 
give almost no information about A, while points where x’ is close 
to A’ give almost no information about \ and x, the parameters 
defining the exponential part of growth. To reduce the range of pos- 
sibilities of the distribution of the sample points to manageable pro- 
portions, attention will be confined to samples at points equally spaced 
with regard to ¢ and symmetrically arranged about r = 0. [In practice, 
a rough idea of the values of the parameters will enable observations 
to be taken at times similar to those considered here]. The points of 
the arrangements considered are spaced along the ¢ scale in units of r 
or 37, while the ranges spanned are 67, 107, and 147. Table VII gives 
the relative sampling variances per point for the parameters, and also 
the generalized information per point using | Q |/n where n is the 
number of points and Q the information matrix for the samples. Two 
cases are considered: @ known (3 parameters to be estimated) and 
6 unknown (4 parameters to be estimated). 

The table shows that more intensive sampling of a given range 
[e.g. —3(4)3 instead of —3(1)3] slightly increases the sampling variance 
per point while increasing the information per point. The increased 
information implies that some or all of the correlations between the 
estimates have been reduced by the increased density of points, although 
the arrangement 3(4)3 is less efficient than 3(1)3 when measured in 
terms of sampling variances only. Increasing the range reduces the 
sampling variances per point as one might expect, and so yields more 
information than more intensive sampling of the same range. The 
sampling variances of @, h, and ¢& are considerably increased when 
@ is unknown, particularly when the range of 7 is small. This is of 
course a reflection of the fact that curves for neighbouring values of 
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TABLE VII 


SAMPLING VARIANCES Per Pornt or THE PARAMETER ESTIMATES 
AND THE GENERALIZED INFORMATION PER Point 
FOR VARIOUS DISTRIBUTIONS OF THE OBSERVATIONS 


6 Known, a, \, x UNKNOWN, o? = 1 


Sampling variance 
per point Generalized 

No. of Allocation . information per 
points of + a N R point 
7 —3(1)3 5.39 39.2 3.60 ow 
ll =H 3.11 23.1 1.14 16.93 
15 TK OU 2.59 18.5 0.56 73.16 
13 —3(3)3 6.05 42.9 4.21 4.32 
21 —5(3)5 3.22 23.8 1.25 54.80 
29 —7(3)7 2.63 18.8 0.60 252.6 


a, \, k, 6 UNKNOWN, o? = 1 


Sampling variance 
per point Generalized 
No. of Allocation information per 
points of + a IN z 6 point 
7 —3(1)3 14.26 361.2 41.50 339.3 0.0324 

11 —5(1)5 4.99 84.0 5.28 89.4 2.008 

15 —T(1)7 3.50 53.4 1.62 54.4 20.16 

13 —3(2)3 17.20 420.8 51.57 398.4 0.2655 

21 —5(3)5 5.04 91.5 6.138 96.3 11.95 

29 —7(4)7 3.60 45.5 1.82 56.2 130.4 


6 can be made to agree very closely over this range of 7 by suitable 
changes in the other parameters. { and 6 have the largest sampling 
variances in all the arrangements and hence are the most difficult to 
estimate accurately. For instance, if an individual value of w, in an 
experiment had a percentage standard error of 5, then var y = var 
(In w) ~ 0.0025, so that with the allocation —38(1)3 we should have 
var 6 = 0.0025 X 48.47, whence s.e. 6 = +0.35. Since 6 is often in 
“the range 1-2, the allocation sr = —3(1)3 would be inadequate to 
determine this parameter at all accurately. Similar considerations 
apply to}. By such arguments one can find what range w must cover 
if adequate estimates of the parameters are to be obtained for 


Wee Ase e*)’ 
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so that 7 = In [W'’"/(A”’ — W'”’)] = logit (W/A)'”. Thus if 6 is, 
say, equal to 2 then r = 8 corresponds to (W/A)'”’ = 0.95257 whence 
W/A = 0.976; similarly r = —3 gives W/A = 0.024. Thus when 
6 = 2 the range r = —3 to 3 implies that W ranges from 2.4 to 97.6 
percent of A, and the argument above shows that this range will often 
be insufficient for adequate estimates of some of the parameters. Note 
that the larger the value of @ the greater the range of W/A required 
to cover the same range of r. 


DISCUSSION 


This paper is not the place for a discussion of the place of mathe- 
matical functions in the description and analysis of growth. A great 
deal has been written on the subject both for and against the utility 
of this approach, and it suffices to say that I assume in this paper that 
differential equations and their associated integrals have some use- 
fulness, and that methods are required for the efficient estimation of 
the parameters. For an approach to the problem of interpreting 
growth curves which involves replacing chronological time ¢ by some 
function of meteorological observations see Nelder ef al. [1960]. [In 
the example above t is in fact a scale based on total incoming radiation]. 

The method of estimation used in this paper is based on three 
assumptions: (i) that the w; are independent, (ii) that E(y;) = Ve 
and (iii) that var y; is independent of Y;. It may reasonably be asked 
how suitable the procedure is likely to be if some or all of the assumptions 
are not true, and this involves us in an analysis of the errors in the NG ve 
Errors may be due to variation in the parameters from one individual 
to another, for either genetic or non-genetic reasons; or they may arise 
from errors in ¢; , caused by using the wrong time-scale. In addition 
we have the possibility of systematic errors arising from deviations 
from the form of the curve actually used in the fitting. It will certainly 
not be exactly true that if, for instance, x has a normal distribution in 
the population of individuals whose growth is being studied, then random 
sampling will enable us to set E(y,) = Y; , where Y, is given by (3) 
with the mean of « used in the formula. But such biases will not be 
serious if the variance of « is small, i.e. if we control the genetic and 
environmental uniformity of the population well. If the w; are from 
the same experimental units throughout, which is the commonest 
cause of non-independence, the method will remain satisfactory if 
var (y;) remains constant, where the variance is now measured within 
individual growth curves. However sampling variances of the esti- 
mates of parameters must now be calculated from the variation in the 
parameter values between individuals. If the variation between indi- 
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vidual curves is much greater than the variation within them, then the 
efficiency of the fitting method becomes correspondingly less important, 
and even graphical methods might suffice. A frequent cause of devia- 
tions of var (y,;) from constancy is the occurrence of constant-variance 
weighing errors on the w scale. On the log. scale these become more 
serious as the weight falls, and lead to var (y;) increasing rapidly at 
very low values of w, where the weights are not large compared to the 
weighing errors. Difficulties in measuring very small distances on a 
plant cause the same effect. In practice the change in var (y) can often 
be approximately eliminated by a suitable increase in replication at 
very low levels of w. 

The model employed here for fitting the logistic curve may be con- 
trasted with a method suggested by Stevens [1951], who proposed 
writing it in the form 


and using his method for fitting curves of the type 


K 


z=a+ 8p, with z= end Spr es". 


i 
W’ 
Since he gave equal weight to his 2’s in the fitting, this is equivalent to 
assuming that 1/w has constant variance and hence that var (In w) « W” 
to the first order. Though conditions may exist where var (In w) « W’, 
in my experience it is usually much closer to the truth to take var (In w) 
constant. Several papers concerned with fitting the curve z = a + Bp' 
have appeared (e.g. Nair [1954], Patterson [1956], Finney [1958], Patter- 
son and Lipton [1959]), but these are almost entirely concerned with 
the case where z has constant variance and where the ¢; are equally 
spaced. Where time scales based on meteorological measurements (such 
as day-degrees) are used for field crops, it is usually impossible or 
impracticable to arrange for equally spaced ¢’s, so that methods used 
in the above mentioned papers are no longer available, even if it could 
be assumed that var z was constant. In addition, none of the methods 
is immediately applicable to the general case where 6 has to be estimated. 
| The reasons behind the particular choice of the parameters in (3) 
should perhaps be mentioned. There are, of course, an infinite number 
of equivalent ways of writing (3), in so far as the structural parameters 
are concerned. The particular form used has been chosen to reduce 
the computing labour of the iterative solution as much as possible, 
while retaining a set of parameters with fairly clear meanings as far 
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as growth is concerned. Thus a = In (asymptotic value of W), while 
and «x define the exponential growth stage for large negative ¢ when 
W ~ Ae**". A possible interpretation of @ for a field crop in terms 
of the spatial distribution of plant material has been given by Nelder 
et al. [1960]. The form used may be capable of further improvement, 
especially from the point of view of the convergence of the iterative 
process. It is an interesting point to what extent approximately 
orthogonal parameters can be found for the general non-linear function, 
that is parameters such that the cross terms in the dispersion matrix 
are small compared to the diagonal terms, and whether the use of such 
parameters would speed the convergence of the iterative process. While 
a linear transformation of the parameters leads to equivalent iterative 
equations, non-linear transformations in general do not, and possibly 
the speed of convergence would be affected by the particular specifi- 
cation of the parameters. 

A natural extension of the generalized logistic equation as defined 
here is the five-parameter equation dW/dt = aW* — BW” discussed by 
von Bertalanffy [1957]. The estimation of the parameters of this 
equation, subject to the same assumptions as those considered above, 
is greatly complicated by the lack of any general explicit solution of 
the equation. Consequently expected values would have to be computed 
by numerical integration which makes the labour involved excessive 
except to those having electronic computers available. A general 
method for the fitting of differential equations has been given by Box 
[1956], and this could be applied to the general equation. However 
we may expect that the introduction of the fifth parameter would 
entail the existence of very extensive data if reasonably accurate esti- 
mates of the parameters were to be obtained. The data discussed above 
cover a 380-fold range of weights, and the standard error of a single 
weight was about 5.5 percent, yet the s.e. of 6 was as high as +0.64. 
The introduction of the fifth parameter might well require a much 
larger range of weights to give reasonably accurate estimates. One 
consequence of this situation is that it becomes difficult to test models 
based on a priori values of the exponents £ and 7, unless very extensive 
data are available, since the fits of curves having é and 7 differing quite 
widely from the a priori values would be hardly distinguishable. 


SUMMARY 
The least-squares fit of the curves defined by 


dw %, W 1/6 
die wt i. (*) | 
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is derived for the case when the sample values of In W are independent, 
unbiased, and of constant variance. 

Tables are provided to assist the computing of the iterative process 
used for estimation, and a method is given for obtaining starting values 
for the parameters. 

Evaluation of the sampling variances of the estimates of the param- 
eters and the generalized information shows that the range which the 
observations cover is more important than the density of sample points 
within that range if minimum sampling variance or maximum infor- 
mation per point is required. The introduction of @ as an unknown 
increases the sampling variances of the other parameters, the increase 
being especially marked when the range of (W/A)™’ is small. 

Some possible sources and consequences of deviations from the 
assumptions underlying the fitting are discussed, and also the diffi- 
culties involved in extending the fitting to the curves defined by 


dW /dt = aW* — pw’. 


It is stressed that sensitive tests of hypotheses involving a priori 
values of parameters may demand very extensive data. 
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COMBINED ANALYSIS OF BALANCED 
INCOMPLETE BLOCK DESIGNS 
WITH SOME COMMON TREATMENTS 


M. V. PAavaTEe 


Central Tobacco Research Institute 
Rajahmundry, India 


INTRODUCTION 


On many occasions, research workers have to lay out their experi- 
ments at different places with one or more treatments common to 
each experiment. The reason may be the lack of sufficient space at 
one location as in field experiments, or as in industrial problems the 
availability of only a limited number of experimental units with stipu- 
lated conditions with the result that the experiment has to be laid out 
at different plants. In either case, the research workers are interested 
in the combined analysis of the data from all of the experiments, besides 
their individual analyses. The purpose of the present paper is to sug- 
gest a simplified method to obtain adjusted treatment components for 
the combined analysis when the individual experiments are laid out 
in balanced incomplete block (BIB) designs. Gomes and Guimares 
[1958] have considered the case when the individual experiments are 
laid out in randomised complete block designs. The case dealt with 
by Gomes and Guimares could be deduced as a special case from general 
method suggested in this paper. 


NATURE OF THE PROBLEM 


Suppose there are g BIB experiments to be analysed jointly with 
the following parameters: 


v = number of treatments, 

= number of blocks, 

= number of replications for each treatment, 

= number of units per block, 

= number of times any two treatments occur together in the 


design. 


a a 


Let us suppose that, for each experiment, there are c treatments common. 
Hence, there are c + g(v — c) different treatments in all. Let us, 
following Gomes and Guimares, call the c treatments (common to all 
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experiments) ‘common’ treatments and the rest, g(v — c) treatments, 
‘regular’ treatments. Hence, the combined experiment will have for 
its parameters 


v’ =c+gv—c), b’ = gb, ki ke 
r’ = gr for the common treatments, 
r for the regular treatments, 
= gi for two common treatments, 
\ for a common and a regular treatment from the same 
experiment, 
0 for two regular treatments from two different experiments. 


ae 
| 


As one can see, the design is no longer a balanced one, but still possesses 
the property of connectedness. 


METHODS OF OBTAINING ADJUSTED TREATMENT 
SUM OF SQUARES 


We proceed to find the adjusted treatment sum of squares for the 
combined analysis by two methods. The general method suggested 
by C. R. Rao [1947] treats the combined data as that of a new incom- 
plete block experiment. Here we will only outline this method since 
it is well known in the statistical literature. The simplified method, 
which the author would like to introduce, makes use of the individual 
analyses already performed and this will be given later, along with 
the results for various special cases. 


Classical Method 


Here, the whole problem of combined analysis reduces to the solution 
of the adjusted normal equations 


Ct = Q (1) 


where the matrix C (v’ X v’) and the column vectors t Oe aa WOM < th) 
have usual meanings. The elements of the C matrix are 


Ci; = g\(v — 1)/k for common treatments, 


C3 = —dAg/k for any two common treatments, 
Cs; = Av — 1)/k for regular treatments, 
Ci; = —A/k for any two regular treatments from the same 
experiment, 
G; = 0 for any two regular treatments from different 
experiments. 


Here the rank of the C matrix is v’ — 1 and hence we adopt a suitable 
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restriction, so that it becomes non-singular and easy to invert. We 
use the restriction 
N yy g(v—c) 
pt a te =. 
t=c+1 
By using this restriction, the C matrix reduces to 


Erg I, 0 
r d 
eh 1, x (Z 1 — *}) 


where I is the identity matrix, 0 is a matrix having only zero elements, 
J is a matrix having only unity elements, X indicates the direct product 
and F# is the efficiency factor of the BIB design; the dimensions of the 
matrices are 

ieee. I,=v—-—c)X wo), 

1 ie te as J; = gv —c¢) XG, (2) 

Jo=(v-—c) XW-o), 0=cX gv—o). 
This matrix has a pattern discussed by Roy in one of his papers (3) 
and hence the inverse can be easily found. The inverse turns out to be 


al 
Erg 


I, 0 


1 1 1 
gEre fi bx (4 ve Ere J) | 
where I, , I, , I; , Ji , Je , 0 are of the same dimensions as in (2). 
Hence, the solution of (1) turns out to be 


i= CO) (3) 
Finally, the adjusted treatment sum of squares is t/Q where t’ (1 X 0’), 


a row vector. Total and Blocks (unadj) sums of squares are found 
in usual manner. 


Simplified method: 

This method is useful to those research workers who are not con- 
versant with matrix algebra and advanced mathematical topics. The 
method suggested here is based on individual analyses of g BIB ex- 


periments. 
Suppose we have performed individual analyses of g BIB experi- 


ments with the same parameter values, viz. v, b, 7, k, \ and with c 
common treatments. 
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Let 


Q: 
Oo” 
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adjusted treatment total of the 7th common treatment in 
sth experiment, 


Rt aie 

sa L, RS oh Of 

>-2_, Q;,, = adjusted treatment total for 7th common 
treatment in combined analysis, 7 = 1, --- , ¢, 


= adjusted treatment total for jth regular treatment in sth 


experiment, j = 1,---,(@v—c),s=1,--:,g. 


From these values, we obtain adjusted treatment sum of squares as 


follows. 


Form the following two-way table, Table I. 


TABLE I 


Two-Way TABLE oF COMMON TREATMENTS AND EXPERIMENTS 


WITH Q;,, VALUES 


c. treat 
Expts. ty tg a be Total 
E, Qin Qo1 see Daa Q. au 
B, Qi.s Qo2 Ast Qe,» Q. 2 
Ey On, Qe, 9 cee VoG OF 
Total Qu Q2 ae Q. Q.. 
Here 
Q a Sy Qi. ? 
t=1 
g 
Q; = Q;.: ’ 
s=1 
and 
c g 
Oe = > are 


Obtain (common treatments X experiments) interaction in the usual 
way. Then, 
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the adjusted treatment sum of squares for the combined analysis is 
Adj. Treat. S.S. (Combined Analysis) = Sum of Adj. Treat. S.S. 
from each exp. 


—1/Er (Common Treat. X Exp. Interaction) (4) 


ESTIMATION OF TREATMENT ESTIMATES AND VARIANCES 
OF TREATMENT DIFFERENCES FOR VARIOUS CASES 


Using the notation at the beginning of this section, we have 
e: a Q:/gEr, (5) 
i? = 2 | a Snare SOE (6) 
Er ares CUn ates 


Further note that, if ¢/‘° is the estimate of the jth regular treatment 
only using information from the sth experiment and if ¢/., has a similar 
meaning for the zth common treatment, then 


e it 
ee Pe Moat (7) 
g s=1 
SS ye eee a (8) 
C j=1 0) i 
or 
Soe ae . eS ee : > Piss (8a) 
i=1 wo 


Thus to find the best estimates, one need only estimate the treatments 
for each experiment seperately and use (7) and (8) or (8a) to obtain 
the final estimates. Formulae (8) and (8a) could be used with advantage 
when (v — c) < cand (v — c) > ¢, respectively. 

At this stage, it might be noted that the adjusted treatment sums 
of squares may also be obtained by the following alternative formula: 


Adjusted treatment S.S. = ss [ee nels) 2 (9) 
Tal An Em 


The variances of the various treatment differences are obtained as 
follows: 


(1) For treatment estimates #; , f;, belonging to the common group 
Vv; = Var oC se i) = 2s”/gEr. 


(2) For treatment estimates ¢; and #{*? where é; is from common group 
and @, is from regular group. 


. ee ale 1 Ee lel 
P= vo — 0) = fb fr 4 yt Lit, 
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(3) For treatment estimates @{° and i‘? where both of them belong 
to the regular group of the same experiment 
Ve ee EO Sy ae 


(4) For treatment estimates ¢{ and ¢;: where both of them belong 
to the regular group but are from different experiments 


~ A A , 2 1 
Esa (s) rt (3 ) ap fn | Pes 
Vs = Var (t; t; ) ri (1 = \), O 
s’ in all of the above formulae stands for the estimated error variance. 


SPECIAL CASES 


(a) There may be several BIB experiments with only one treatment 
common which might be a control. In this case c = 1. 


Let t, denote the control. Then we have the following results. 


I. Adjusted treatment S. S. 


Sum of the adjusted treatment sums of squares from individual 
analyses 


IL. Treatment estimates. 


A i £ 
ty = Se es ) 
g s=1 


A 


HO = oO — + : i, based on (8a). 


III. The variances of treatment differences. 
(1) This case does not arise 
(2) Wi =?) st rhe 
(3) VE St) = 2s rE: 
COG eet Oe ee Cop, 
The numbers here correspond to those in the preceding section. 


(b) There may be several BIB experiments with the same treatments. 
In this case c = v. In other words, we want to combine several 
BIB experiments with identical parameter values and treatments, 
conducted at different places. In this case, we have results as follow: 


COMBINED INCOMPLETE BLOCK DESIGNS Lhd, 


I. Adjusted treatments S.S. 


III. 


(1) 


(c) 


(d) 


The variances of treatment differences. 
Var (é; a 5s) — 2s" /rEg, 1 a", 1, v= ils SOME 
(2), (3), (4) These cases do not arise. 
The most important special case is obtained when b = r = X 
and k = »v. In this case BIB designs reduce to well known ran- 
domised complete block designs with c common treatments. The 


simplified analysis will remain the same with corresponding sub- 
stitution. 


Cases (a) and (b) can be considered for (c). 

Since (c) and (d) have been considered in detail by Gomes et al 
[1958], they will not be repeated here. However, we present the 
adjusted treatment sums of squares for the three situations since 
they are somewhat simpler than those given before. 


Case (1), ¢. = 1. 


Adjusted treatment 8. S. = Sum of treatment 8. 8. from the 
individual experiments. 


Case (ii), 1 <e¢ < v. 


Adjusted treatment S. S. 


I 


Sum of treatment 8. 8. from 
the individual experiments } — 


{Interaction (common treatment X 
experiments) using treatment totals. } 
Case (iii), c = v. 
a 2 2 
Treatment 8.S. = >> ge 
1 rg ~~ gor 


where 7’; is the total for the zth treatment over all experiments 
and G is the grand total for all experiments. 
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EXAMPLE 
(Hypothetical problem purely to illustrate the procedure) 
Two BIB experiments with the following parameter values are 


conducted and the combined analysis is required. ‘The parameter 
values are 


p= 7, b=7, 7 =4, b= 4) XS e2, g-= 2 and. (673; 
The following data are at our disposal. 


TABLE 2 
ANALYSIS OF VARIANCE FOR THE INDIVIDUAL EXPERIMENTS 


Experiment 1 Experiment 2 

Source Gli 8.8. d.f. S.S. 
Blocks (unadjusted) 6 342.71 6 63.21 
Treatments (adjusted ) 6 171.14 6 154.50 
Residual 15 165.11 15 469 .00 

Orn ats.00; (Pe 1.00, Oia = 18:75; 5 S450; 

Qe1 =a 7.50, OF a iti, Q2,2 = On tay Me = 0.50, 
Os4 = —6:75,. QOS == 8.75,- Cae = —9-15,, On = oo 
= 0.75. i = —5.00. 


Form, with Q;,, values, the following two-way table. 


; bi ty tz Total 
Ey 18.00 7.50 =O 75 18.75 
E, | 13.75 =H. 16 =) 0 So 
Total | 31.75 1.75 —16.50 17.00 


The (common treatment XX experiments) interaction is found to 
be 31.2708. 


Hence 


Treatment (adjusted) S. 8. 
for the combined analysis 


= 171.14 + 154.50 — 4/2 X 7 (31.2708) 
= 316.7055 
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The combined analysis is given below. 


TABLE 3 
CoMBINED ANALYSIS OF VARIANCE 


Source of variation d.f. 8.8. M.S. 

Blocks (unadjusted) is 425.375 32.7212 
Treatments (adjusted) 10 316.708 31.6708 
Residual 32 643 .042 20.0951 


With the use of the equations (7) and (8) we obtain the following 
treatment estimates for the combined analysis: 


i= A857. ts = 70:25005 i, = —2.3571, 
Eo = 120619, og = 8 761) 76D. 
iy = —0.7619, fe 2.2619, is = | 3.9762, 
Hor = — 13610! 7 = —0.4n0e 
Variances of treatment differences are 
V, = 2s°/Erg = 5.741, 
V2 = is E + ; + ; - | = 9.569, 
V; = 2s°/Er = 22.966, 
V.= Z (1 + Ne = 15.310. 
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GENERALIZED ASYMPTOTIC REGRESSION AND 
NON-LINEAR PATH ANALYSIS" 


Matcotm E. Turner, Ropert J. Monron anp Henry L. Lucas, JR. 


Department of Experimental Statistics, 
North Carolina State College, Raleigh, North Carolina, U.S.A. 


1. INTRODUCTION 


It is a fundamental fact of statistical inference that the information 
contained in an analysis of experimental data is the sum of the a priori 
information built into the statistical model employed and the a posteriori 
information contained in the data itself. For this reason the bio- | 
metrician must be concerned not only with the efficiency of his estimation 
procedures but also with the adequacy of his descriptive model. Much 
care must be taken to ensure that all relevant a priori information is 
utilized in the construction of the model. In some cases it is possible 
to derive rather sophisticated theoretical models on the basis of acquired 
knowledge and intelligent hypothesizing. These models are often con- 
veniently found as solutions of differential equations. In other cases 
little more may be known than that the biological process in question 
is continuous. In this latter case one may resort to polynomial models 
where the degree of the polynomial is either found empirically or by 
prior consideration of the number of ‘‘bends” which one can reasonably 
assume to take place in the process being studied. In certain other 
cases it may be known that the process approaches some asymptotic 
value. This is especially true in those cases known as ‘‘growth” pro- 
cesses. The general ineptness of polynomial models for purposes of 
describing such asymptotic situations has been repeatedly pointed out, 
although polynomials in the reciprocals may sometimes be used con- 
veniently. 

Stevens [1951] and Pimentel-Gomes [1953], writing in this journal, 
have discussed inferential methods related to one form of transcendental 
asymptotic model, the so-called exponential model, 
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by a grant from the General Education Board for the purpose of stimulating basic methodological 
research, 
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y=at Be" +, (1) 


where x and y are observables; a, 8 and y are unknown parameters to 
be estimated; and ¢ is a random error. The above mentioned writers 
give convenient computational procedures for finding maximum likeli- 
hood estimates when ¢ has a normal distribution with constant variance 
and the errors are uncorrelated one with another. Even when the 
errors are non-normal these same methods provide least square esti- 
mates of the parameters. The numerical devices used for finding the 
estimates are based on methods devised by Gauss and described in 
detail in the books by Whitaker and Robinson [1944] and Deming 
[1943]. The method of estimation, maximum likelihood, is due originally 
to Gauss and is discussed extensively by Fisher [1956] who is mainly 
responsible for our knowledge of the properties of the method. Both 
the numerical devices and the estimation method are reviewed in the 
recent work by Williams [1959]. 

Asymptotic models fall conveniently into either the class of trans- 
cendental models, including the exponential, or the class of rational 
models, including models such as the rectangular hyperbola and the 
inverse square law. 

This paper considers the general class of rational models and a 
special sub-class of transcendental models that may be regarded as 
a generalization of the exponential model. Appropriate techniques for 
estimation and for determination of asymptotic confidence limits, 
similar to those given by Stevens for the exponential, are provided. 
An alternative “‘parabolic method” of constructing confidence limits, 
valid for small samples, is also given. 

In many practical biological situations a number of variables are 
related one with another in complex causal networks. Wright [1918, 
1921] devised methods, which he termed path analysis, for analysizing 
such networks when the relationships between the variables are linear. 
Turner and Stevens [1959] have reviewed these methods for the im- 
portant case when some of the primary variables may be “fixed” or 
without probability distributions. The present paper considers some 
aspects of such situations when the relationships between variables are 
of the transcendental variety rather than linear. 

All of the models considered in this paper are non-linear in some of 
the unknown parameters. This implies that sufficient estimates and 
hence fiducial limits are not available. 

In addition the maximum likelihood estimates, though consistent 
and efficient, will be generally biassed, will not have normal distributions 
for small samples and will generally not have explicit forms, thus 
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requiring iterative solution methods. This is the price one has to pay 
for more adequate symbolic description of the process being studied. 
The gain in meaningfullness is generally well worth even this heavy toll. 


2. RATIONAL REGRESSION 


Let us denote a particular p-th degree polynomial by P,(x) and a 
particular g-th degree polynomial by Q,(x) and then let us consider 
the regression model arising from the rational function, 


y = [P,(a)/Q.(@)] + «, (2) 


where again x and y are observables and e¢ is a random error assumed 
to have a normal distribution with zero expectation and constant 
variance o. The errors are also assumed to be uncorrelated one with 
another and with the values of x. At least some of the coefficients in 
the polynomials are unknown parameters to be estimated. 

The numerical device for estimation due to Gauss, which was alluded 
to in the previous section, involves expansion of y — e about “trial 
values” of the unknown parameters in a Taylor’s series. The method 
is sketched in Section 4 and is there applied to the models discussed 
in Section 3. However, a simpler approach is available for the rational 
model (2). 

Suppose that Q,(x7) = 1+ Bix + --- + 6,2’, so that (2) may be 
written: y = [P,(2)/(1-+ Bia + --- + 6,2°)] + « If we multiply 
through by Q,(x) and rearrange we get 


Y= Pa) pity — ee a Ce (3) 


Now (8) is an ordinary multiple linear regression model except for the 
factor Q in the error term. Thus weighted least squares could be used 
for estimating the unknown parameters if the weights Q~? were known. 
A simple plan is to take trial values for the parameters 6; , --- , B, 
and compute provisional weights. Then (38) is fitted by least squares 
giving better estimates of the 6’s as well as provisional estimates for 
a, %,*'* , @ occurring in P,(x). Using the improved estimates 
for the 6’s, better weights are determined and the fitting is repeated. - 
The process is thus iterated through several cycles until stable values 
of the estimates are obtained. In many cases it is sufficient to use 
constant weights (Q°* = 1) in the first cycle of iteration, thus not 
requiring trial values. 

Asymptotic variances and confidence limits are found in the usual 
way from the final least square fit. Thus if for n observations we have 
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2 2 
Loy 1 9s OT Ry By, »** aly; 
2 2 
et 1 %_ % +: 23 LoYo L2Yo °** L2Yy 
-_— ef sive, ere eee eee ? 
~ 2 2 
HGR es ats om) ah) eg 


Ba (a Xai Sees ay By as Bs 


[@:° 0 0 
Ww OP nares 0 
Bos Oe 
then the final estimates after iterating are 
B = (X’WX) 'X’WY (4) 
and the asymptotic variances and covariances are given by 
Cov (B) = (X’WX)"'s’, (5) 
where s”, the estimate of o”, is provided by 
s = (YWY — BX WY)/n — p — ¢ — 2 (6) 


If the error in (2) depends upon z in a known way, only a slight 
modification of the procedure is required. Suppose that e, = f(x)-e. 
Then the error term in (3) will be fQe and the weight matrix W will 
be diagonal with elements (f;Q;)~? instead of (Q;)"?. Formulas (4), 
(5), and (6) will remain the same. 

Computational details for those not familiar with linear regression 
methods are given by Williams [1959]. Let the elements of (X/WX)~’ 
be denoted by c¢;; ; then the individual asymptotic confidence limits 
for each coefficient are 


Oe EN be oe, tN 8 4=0,1,-:-,p, (7) 


Bout N/ ConoBy eB, ul cns bie Vili l (0; 


where ¢, is a Student’s ¢ at an assigned confidence level (1 — 7) with 
n — p — q — 1 degrees of freedom. 

As an example consider the case of p = 0 and q = 1, the case of 
the rectangular hyperbola. We have 


y = [ao/(1 + 62)] + « (8) 


with asymptotes x = —1/8, and y = 0. Corresponding to (3) we have 
the regression equation y = a — Biry + Qe, where Q = 1 + fie. 
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A graphical estimate of the slope —6, can be obtained by plotting y 
against zy and provisional weights computed from w = (lL -EBiz)e. 
Then using these provisional weights we compute 


A wv vey — Dvyw Dw _ dw Bay (9) 
Br by ya W My wx y a Os wary) ye W Soeny 


and obtain better estimates of the weights. After stability is reached 
we find 


a = eS wy + By 3. way)/ >> W (10) 
and the estimate of the variance o’, 


: 1 o2 w >, wy — (vy)? — Serv \ (11) 


Ss — 
n— 2 Ww Sau 


We also obtain estimates of the asymptotic variances for a and b, : 


5 sided (>> wry)? } 
Sao Se aay + ee 

= i CDS mss 
a ae S)/Se 


from which asymptotic confidence limits may be calculated. 


(12) 


3. TRANSCENDENTAL REGRESSION: 
THE SINGLE PROCESS LAW 


The rational models discussed in the previous section may often 
be used to conveniently describe an asymptotic process. However, 
when more than just a few terms in the polynomials are required for 
adequate description of the process, the rational models may suffer 
from three deficiencies: (1) physical or biological meaning of the coeffi- 
cients may be lost, (2) annoying ‘“‘bumps” tend to appear in the fitted 
curve which have no reality in the actual process, and (8) excessive 
numerical computation may be required due to the number of para- 
meters which need to be estimated. 

Certain transcendental models may provide a more parsimonious 
description of an asymptotic process and yet preserve considerable 
flexibility of form. By ‘“‘parsimony” is meant the ability to describe 
adequately with a small number of parameters. This may be accom- 
plished by supplying the a priori information that the curve is regular 
in some way or another. For example, we may specify that the curve 
is monotonically increasing or decreasing throughout its course or that 
certain of the derivatives have this monotonicity property. We may 
then avoid the undesired “bumps” and make possible simpler com- 
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putations by using this information. In addition, since solutions of 
most differential equations are in terms of transcendental functions, 
we may wish to seek a model which is related to a class of differential 
equations, thus making interpretation of the parameters more meaning- 
ful. 

One simple class of transcendental regression models has been called 
by Turner [1959] the single process law. Consider the differential 
equation 


dn/d& = 6(y — a)/(E — ¥ 4). (13) 
Integrating we get 

nw rye8) 14) 
oF BE a) = as. 


Now let us suppose that the observables x and y are defined by the 
error equations x = £,and y = 7 + e, where « is a random error defined 
as before. Thus the model corresponding to the single process law (14) is 


y=at Bir —y7 d+. (15) 


I 


or n 


For convenience we will always take x — yé6 > 0. 

The proportionality constant 6 in (13) determines the nature of 
the process. When 6 is a positive integer, we have a polynomial process; 
however, the model (15) is non-linear when this integer is greater than 
two. When 6 is a negative integer, then we have a rational process, 
a special case of the processes studied in the previous section. When 
6 is non-integral, the process is transcendental. In particular, when 6 
approaches infinity, either positively or negatively, the model (15) ° 
approaches the exponential model (1), y = a + Be-”” + «. 

An interesting series of models, most of which are commonly used 
in curve fitting, is obtained when 2/6 = —4, —3, —2, —1, 0, +1, +2. 
These cases correspond to the inverse square root law, the inverse two- 
thirds law, the rectangular hyperbola, the inverse square law, the ex- 
ponential, the parabola, and the straight line, respectively. Thus all 
of these are special cases of a continuously varying family of curves 
(See Figure 1). We observe that when 2/6 is negative the curves have 
two asymptotes, that when 2/6 = 0 there is but one asymptote, and 
that when 2/6 is positive there is no asymptote. Thus the exponential 
may be thought of as a ‘‘transition” curve between those curves possess- 
ing two asymptotes and those possessing none. Without loss of gener- 
ality we may consider only the case of decreasing curves, commonly 
referred to as “extinction curves”. Several useful constants for the 
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INVERSE SQUARE - ROOT 


HYPERBOLA 
INVERSE“SQUARE_LAW__, 


EXPONENTIAL 


FIGURE 1 
SPECIAL EXTINCTION CURVES 


special cases enumerated above are summarized in Table 1. In this 
table the minimum level is taken in all cases to be a and the initial 
level is measured from this minimum value. 

Table 1 includes the three quarters life and the half-life. The ratio 
of the two provides a convenient means of estimating 6 graphically. 
From equation (14) we determine the initial value of 7 to be m = 
a + B(—y6)’ and the minimum value of 7 to be qin = a. The value 
of — corresponding to (m — a)/2 is the half-life and is given by £55 = 
vyé(1 — 27”’). Similarly we obtain the three-quarters life &;;=yd(1—4°”°). 
The ratio is then \ = £75/Es = (1 — 4.”°)/(1 — 2°”), or, inverting, 
we have 


6’ = —log, (A — 1). (16) 
The parameter \ is given for each of the special cases in Table 1. and 


the relationship (16) is charted for all cases in Figure 2. 
A more general criterion based upon any two percentile-lives is 


readily found to be 
NOP, P) = —n/e, = (= Gi )/1 — a"), (17) 


where 7, +@% =P2rt+@=1 
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RELATIONSHIP BETWEEN THE \ CRITERON AND THE EXPONENTIAL PARAMETER 6 


4, THE GAUSSIAN ITERANT 


Gauss appears to have been the first to use the Newton-Raphson 
technique for purposes of finding least squares estimates of non-linear 
parameters. See Whitaker and Robinson [1944]. In the case of regres- 
sion with normally distributed errors, Gauss’ technique is equivalent 
to the important method of scoring due to Fisher. See Fisher’s recent 
discussion [1956] of the subject. 

The technique may be conveniently described in terms of the total 
derivative. Let us suppose that we have the general regression model 


y = f(G,z7) +e 


where 6 is a vector of unknown parameters. The fitted “prediction” 
equation is, in terms of estimates b, 


By the rule for the total derivative we get dg = 9, db, + 9.db. + ::-, 
where 7; = 09/0b; . Then to a first approximation 
Ag =F Gr0 Ab, te Goo Ab» ae AS tr) 


where Ag = y — yo, Ab, = b, — diy, Abo = be — bag, -:: . The zero 
subscript indicates that trial values for the undetermined estimates 
have been inserted. Thus, we get 7 = G + Abi Gio + AboGoo + +: 
and finally 


) 


= Yo te Absiio ae AbsG20 “per she. (19) 


The symbol e represents the estimate of the error e. Now (19) is an 
ordinary multiple linear regression equation and estimates may be 
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found as before. From the linear regression estimates may be found 
improved values for the original non-linear regression estimates. Thus, 
the process is iterated finding successive improvements. It may be 
seen from (19) that trial values for estimates which are linear in (18) 
are not needed to prime the iterative process. This point was noted 
by Stevens [1951] and will be illustrated in the next section. Refer to 
Deming [1943] for a fuller discussion of this technique of estimation. 


5. MAXIMUM LIKELIHOOD ESTIMATION AND ASYMPTOTIC 
CONFIDENCE LIMITS FOR THE SINGLE PROCESS LAW 


Applying the method of the previous section to the single process 
law we obtain the linearized regression equation: 


y=at be — m)” — bd (x — m)*'(m — m) 


+ b(a — m)* In (ev — m)(d — do) +e, 


(20) 


where dy , bo , mo , and dy are trial values and m = cd. Note that (20) 

is linear in a and 6b and the corrections (m — m,.) and (d — d)). Thus 

provisional estimates of a and b and improved estimates of m and d 

can be found by fitting (20) by least squares. Iteration provides the 

maximum likelihood estimates. We notice that trial values of the linear 

coefficients a and 6 have been eliminated and are thus not required. 
For convenience we write 


Bo — a, Lo = dks 

By =)0, Z, = («& — m)”, (21) 
iB — —b dj(m oe Mo) , Z» aa (x om men, 

1BY. =e b(d = do); Lp = (x ie Mo)” In G: a Mo). 


Then (20) may be written 
y = By a BZ; ae BZ, S- BaZa +4 Cr 


We have then to find estimates of B, , B, , B, , and B; . These are 
found by multiple regression methods as follows. Let 


a Zi Liz Z13 Bo Yi 
Vie 1 Zor Zoe Zo ; ie B, ars Yo 
B, 
= 1 bee in Woes Bas Yn a 
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The normal equations are Z’ZB = Z’Y and, if Z’Z is non-singular, 
the vector of improved estimates is given by 
B = (Z'Z)"Z’Y. (22) 


Whenever the error depends upon x in a known way, (22) must be 
replaced by B = (Z’WZ)7'Z’WY where W is a diagonal matrix of 
weights as in Section 2. From the results of (22) we obtain improved 
values of m and d by using the relations (21). Thus 


m= Mo ae B,/doBy ) 
d= d+ B/B, . 


(23) 


I 


These improved values replace m, and dy in (21) and the solution (22) 
is found again, giving still better values. The process is iterated until 
constant values of m and d are found to the desired degree of precision. 
After stable values are obtained we have 


Ci =sDa Oath Cn 0 (24) 
and the estimated curve is given by 
§ =a+t be — m)’. (25) 


If » = y6 is known a priori, then the third row and column of Z'Z, the 
third element of B and the third element of Z’Y are deleted. Other- 
wise, the iterative process is the same as before. Application of this 
case to a biomathematical model of running records has been made 
by Turner [1959]. Similarly, when 6 is known a priori, the fourth row 
and column of Z’Z and the third elements of B and Z’Y are deleted. 
If, as is often the case in extinction curves, the minimum value a is 
known a priori then y is replaced by y’ = y — oand the first row and 
column of Z’Z and the first elements of B and Z’Y are deleted. Thus, 
we see that one estimation procedure suffices for the general problem 
and several special cases as well. However, in the case that 6 is a 
known integer, we may prefer to use the methods of Section Zoi ES 
is known to be infinite, the methods of Stevens and Pimentel-Gomes 
referred to earlier should be used. 

By examination of the definition of the Z’s, equations (21), we 
see that Z’Z is singular if 6 = 0, 1, or ko. Therefore, the method 
of this section may be expected to break down in the neighborhood of 
these points. Trial and error minimization of the residual sum of 
squares, S= > fy — a — b(« — m)*]’, may then be resorted to, perhaps 
utilizing some interpolation method such as that proposed by Will [1936]. 
Values of m and d may be selected and S minimized by ordinary straight 
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line regression techniques. Thus for the selected values m, and d, 
we may regress y on X = (x — m,)”, obtaining the minimum S for 
these values: S. = 2) (y — 9)” — {1 Xy — HP/D (X — &)}. 
Similarly, values S;. for m, and d, , Sez for mz and d, , et cetera, may 
be found and the minimum located. 

Alternatively, we may choose selected values of 6 only (or of pu 
only) and use the deleted normal equations to minimize with respect 
to a, 8, and u (ora, 8, and 6). Then simple one-dimensional interpolation 
will locate the minimum value of S. 

When Z’Z is only moderately ill-conditioned, we may improve con- 
vergence by damping oscillations. Wegstein [1958] recommends using 
in the + 1-th iteration (in place of m; and d;) the modified values 


mr = um;-1 +m; , 
d* — Ue Chea + od; . 


The following data taken from a sampling experiment [Turner, 1959] 
will illustrate the iterative process. 


(26) 


8 


y 


162.47 
108.10 
$1.12 
66.11 
50.06 
49.37 
39.72 
33.81 
30.90 
30.73 


SOON OOH WH HH 


fem 


Trial values m) = —1.00 and d, = —1.00 were tried and the results 
below were obtained for successive iterations. 


Iteration m d 


rwWd FH © 
| 
(JX) 
(ee) 
| 
lop) 
ive} 
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Asymptotic confidence limits are easily found. After the final 
iteration the asymptotically unbiassed estimate of o is given by 


“a O25 BG aie D Ay 
—= fe SB, Zoy — Bs ys Zay)/(m — 4). (27) 
Let us denote by c;; the element in the 7th row and jth column of 
(Z’Z)~'. It has been well known since Gauss that the variance o; 
of any function of the B’s (f say) is asymptotically given by 
3 3 
=o *s, 2a Cuil ) 


where f; = Of/dB; and f; = of/0B; . We may use as the estimated 
variance of the function the quantity 


3 3 
sj zg, 2 Do Coifihi . (28) 


Then from (23) and (24) using (28) we find the estimated asymptotic 
variances for a, b, m, d, and c to be 


S-= Cos).  S, = Gus [0 a, 
3 = Cus, S58 70 , (29) 
8. = [(o2/b’ d’m) — (2c23/b" d°) + (m’e,3/b” d’)]s’. 


Finally, for y, using (25) and (28), we find the asymptotic variance 
= 3" : 2 Coie (30) 
From (29) and (30) we can construct asymptotic confidence limits. 
We get 
8a SOS CE Paso, 
— 18 58 Sb + £8, , 
tS, os St tS 
—t8:5 56Sd+t,8, 
€ ahaa Steer cleat eee 
g — 18; 5 EY) S 9+ 4,8; , 


with (1 — 7) confidence each where t, is Student’s ¢ for the z level 
of significance with n — 4 degrees of freedom. ‘The last confidence 
statement provides confidence limits for the true value of 7 = E(y) 


(31) 


a = os 
| 
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fora givenx. See Deming [1943] for detailed discussion of these methods 
for any model. It must be emphasized that (31) provides individual 
confidence statements at the assigned (1 — 7) level of confidence. 
Of course, the joint confidence level is much less than (1 — 7). 

These confidence statements provide a ready guide to the tenability 
of various hypotheses about the unknown parameters in the single 
process law. If it is specifically hypothesized on theoretical grounds 
that some particular parameter takes a given value (e.g., Hy : 6 = 6), 
then we may test the concordance of the data with this hypothesis 
by seeing whether or not the hypothesized value lies within the con- 
fidence interval. If the hypothesized value is in fact tenable, we may 
wish to refit the regression equation after making the proper deletions 
for a known a priori parameter. On the other hand, if there is no 
theoretical reason for making such a hypothesis, it does not seem wise 
to recompute the curve for some convenient value of the parameter 
lying within the confidence interval. Such practice will lead to a false 
sense of confidence in that narrower confidence limits will result from 
fixing the parameter. 


6. PARABOLIC CONFIDENCE LIMITS FOR SMALL SAMPLES 


The asymptotic confidence limits described in the foregoing section 
may be expected to give adequate results for large samples or even 
for quite modest samples if the error rate is small. For example, in 
a recent sampling investigation [Turner, 1959], it was found that, 
with a sample size of ten equally spaced points covering the major 
portion of the curve and with a coefficient of variation of two percent 
at the center of the curve, the sampling variation of d, the estimate of 6, 
was quite accurately described by the asymptotic distribution. How- 
ever, when the error rate is much larger, it may be supposed that the 
asymptotic distribution will not suffice for construction of tests or 
confidence limits. 

We will now describe a method of constructing an exact test and 
corresponding confidence limits based upon a class of “parabolic” 
alternative hypotheses. 

We consider the null hypothesis 


Ho ie — Mm = 6— & = 0. (32) 
Thus, when the null hypothesis H) is true, we have the model 
Yee ar B(x; ae Hos: + ¢€, (33) 


where 7 = 1, 2, --- , n for sample size n. 
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Let Zi0 = (@; = pa) then the model (33) for H, true is written 
Yi =a + 62.0 16: 5 


the model for a straight line. 
Now, if H, is not true, we could write 


y; = f(Zio) +e - (34) 


Expanding (34) in a Maclaurin series and preserving terms through 
the quadratic, we get 


y; = Bo + BiZio + BoZio + «&; . 


If we suppose that coefficients of terms beyond the quadratic are 
negligible, then the null hypothesis, 


(HERS 3p (0), 
is equivalent to (32). An exact test of Hj is well known and is easily 
performed. 
Let 


Sue en Oana 
Sin = D0 Zio — [CLE Zio) DE Zic)/n, 
Sy = LD Zoe — (CL ZX yd/n, (35) 
Su = Dy Zin — |e Zia) a 
Soy = DY Zioys — (D0 Zi) De ys) /n), 
Sid =e a et 

where all summations are over? = 1,2,---,n. Then we have estimates 
B, = (82281, — Si2S82,)/(Sr1S22 — Ste), 
Be= (SuSa — Sessa (Sass ae 


Then the residual sum of squares about the fitted quadratic curve is 
given by 


(36) 


SSH = Mee ae Bs. ae Bess . (37) 
Similarly, the residual sum of squares about the straight line is given by 
SSH’ = Se = GS Se.) (388) 


Then an exact F-test for H{ : B, = 0, which is approximately equivalent 
to Hy: uw — bo = 6 — 6 = O, is performed by calculating 


F = (n — 3)(SSE’ — SSE)/SSE. (39) 
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Then if F > F, , the tabular value of F with 1 and n — 8 degrees of 
freedom, the null hypothesis is rejected at the x level of significance. 

A joint confidence region, with (1 — 7) confidence, for » and 6 
(and hence y and 6) may be found by determining those values of jp 
and 6) which ensure that F < F,. This may be done most conveniently, 
perhaps, by calculating F for various selected values of uo and 6) and 
then interpolating to find the confidence contours. A numerical example 
when yu is known a priori has been given by Turner [1959]. 

In some cases it may be necessary to retain higher-power terms 
in the Maclaurin expansion. Since then a wider class of alternative 
hypotheses would be admitted, the power of the test against any spe- 
cific hypothesis would in general be lower. 

When observations are replicated for various values of x, the numer- 
ator of (89) may be tested against the within replicate sum of squares 
to give a more powerful test. If the within replicate sum of squares 
is denoted SSE*, we have the test 


F = n(r — 1)(SSE’ — SSE)/SSE*, (40) 


with 1 and n(r — 1) degrees of freedom, where 7 is the number of points 
on the curve as before and 7 is the number of replicates. A confidence 
region may be found corresponding to this test just as before. 

A test similar in concept to the parabolic test has been proposed by 
Williams [1959] and is discussed in connection with the single process 
law by Turner [1959]. Williams’ test is not likely to be as powerful 
as the parabolic test whenever there is more than a single non-linear 
parameter, as in the present case. If either uw or 6 is known a priori 
then Williams’ test should be considered as an alternative to the para- 
bolic test. 

Hotelling [1939] has proposed a method of constructing exact tests 
which could be applied to the construction of a test of the hypothesis 
8 = Oin the single process law. However @ is the constant of integration 
and is generally of little interest since the character of the process is 
not determined by this constant. 


7. NON-LINEAR PATH REGRESSION 
AND THE SINGLE PROCESS LAW 


Let us suppose that we have p ‘‘secondary” variables which are 
causally related to one another and to a set of g “primary” or “‘control- 
lable” variables. We imagine that the gq primary variables (denoted 
£,, & , °°: , £,) have values which can be chosen at will by the experi- 
menter. In particular, it is possible that they are randomly chosen 
but this is generally not so in laboratory experimentation. The p 
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secondary variables (denoted m , m2, *** , Mp) are then supposed to 
be completely determined. The rates of change of one variable with 
respect to another are determined by a matrix of partial derivatives 


Nia Na Npa 
Nib = Neb Nob 
Nia N2a *** Noa i) 
H’ = = whe (41) 
Uppte UP | OF ha a! 
M2 N22 Np2 
L ip Nop. °°° Nop 
When 7:; = a; , a constant, then the relations between all of the 


variables are linear. A recent review of linear path regression analysis 
is that of Turner and Stevens [1959]. See also Turner [1959]. The 
method of path regression analysis, originally described and named 
by Wright [1921], has been applied only in the case of linear relations, 
with one exception [Tukey, 1954]. The method, however, only attains 
its full power when the relations between variables are described by 
non-linear equations. This is partly what Tukey meant when in his 
critique [Tukey, 1954] he said that classical path coefficients are ‘‘ very 
good, but they don’t go far enough”. For example, the important 
field of chemical reaction kinetics requires a path analysis wherein the 
relations between variables are governed by the law of mass action. 

In many cases we may regard 7;; to be a function of 7; and £; (or 7;) 
only. In particular, we may consider that the relations between varia- 
bles follow the single process law. 

Let a matrix of rate coefficients be 


Olan (Olgeet Ons 
Oxy dap oe Onp 

ae bes HOR eos | 
TGs gions A’ 
Oro Ons 
51> bop 1 =I 
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and diagonal matrices be 


fo 0 Hd, O 
De O &, Ce es Da 0 by °°? O 
LO & 0 O Ma 
| m 0 a, 0 0 
pa O 1, 0 De Oras 0 
Ihe (eo een iy eG) See 7, 


Now we will consider the multivariate analogue of the single process 
law given by 


, oe aml / a 
f= ee — ee D,) AWD, a (42) 
4 (D, a Di AD, ee De) 
The analogy in form to the univariate differential equation (13) is 
evident. A solution of (42) is 


m = oO + BiG. — Ma)’**Es — be)? + 
“(Eq — be)’**(1)(n2 — az)?** ++ > (ap — )?”” 
Na = 2 + Bo(Es — Ma)’ **(Es — oe)??? > 
“(Eq — Ba) "(m1 — 1)'7*(1) +++ (np — )'*” (48) 


Np = Oy + By(Es — Ha) (Es — oy)?” ++ 
-(E, ao Me)? "(m == ay)??* (no aa 3)” rae (1). 


Thus (43) are the “structural equations” for the causal network arising 
from allowing each pathway to individually follow the single process law. 

As with linear path regression analysis many of the pathways may 
be non-existent for any particular problem. These restrictions on the 
structural equations are made simply by setting the corresponding 6 
to zero. 

Note that the structural equations (43) relate ‘‘true” variables and 
parameters. No stochastic variables have been considered yet, and thus, 
we may transform the equations arbitrarily. We take logs as follows: 


n¥ = log (m — a), f= log ( —"1,); at = log B, , 
log (ne = Qe), &F = log (5. — ps), ay = log B , 


ne 


log 654 


log &, am Bale a 


np = log (np = 5); ee 
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and a* = 6,;. Then (48) becomes 
a¥ + ake + ake + +--+ + até + afnk + --: + atnd 


*% 
ll 


Mm 
na = a7 ager “bates te FP ones ae On nr t+ 26 a Osan (44) 
ny = a% + att - okt} + = 2 + ate ont ie ee 


These transformed structural equations are identical in form with 
those underlying linear path regression analysis. For a more detailed 
comparison see Turner [1959]. We may write (44) in matrix notation 
as follows: 


(A, ay] | - (45) 
n 
where 
ax 1a Aig 0 as at, 
(A, , A.) = ae ae pag aes : , oy ’ 
Eo ies 2 Cae Op Op 0 
and 


EO (Eee 2 ee): 
n’ = (ni n¥ +++ 0%). 


From (45) we get by multiplication 


A,E+ An=n (46) 
and 
n— Aon = AE 


(I — A,)n = AjE. 


If (I — A;) is non-singular, we get the reduced transformed structural 
equations. 


n= Ae Ae (47) 


Thus, the transformed secondary variables n are expressed as linear 
functions of the transformed primary variables ~ and it is seen that 
the necessary and sufficient condition for this to be possible is that 
(I — A,) be non-singular. 

Inverse log transformation will then provide reduced structural 
equations in terms of the original variables also such that each secondary 
variable is a function of just the primary variables. 
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There are several types of causal relationships which are con- 
veniently distinquished. First of all there is the situation in which 
a single secondary variable is determined by one or more primary 
variables. This is the case of ‘ordinary regression”, the single structural 
equation being the non-linear analogue of that corresponding to the 
usual multiple regression model. A second case is the situation in which 
two or more secondary variables are jointly determined by some or 
all of a common set of primary variables. We term this case the case 
of “joint” regression. In a third situation there is a chain of cause and 
effect leading from one or more primary variables to a secondary variable 
and then on to still another ‘‘secondary” variable. This case we term 
the case of “chain” regression. A final type involves cycles of causation 
or “feedback” from one secondary variable to another and back again 
after possibly passing through one or more other secondary variables. 
This is the case of “cyclic” regression. All of these cases individually 
or in combination have been synoptically treated in what has gone 
before. Here we wish only to note that the several cases are distinguished 
by having different kinds of (I — A;) matrices. For this reason we 
term (I — A,) the ‘‘classification’? matrix. It may be verified that 
regression type and (I — A,) are related as follows: 


regression type (I — A;) 
Ordinary 1 (scalar) 
Joint it 

Chain Triangular 
Cyclic (feedback) Non-triangular 


The heavy algebra implied by (47) may be conveniently by-passed 
by the use of algorithms (see Turner and Stevens, [1959]) when the 
classification matrix (I — A,) is triangular, that is, when there is no 
feedback in the system. General algorithms are also available. 

We now consider three elementary systems to illustrate the fore- 
going and to provide concreteness. 

Consider first, the ‘‘multiple regression” analogue, with the path 
diagram below. 


&a No 
— F 
en 


The structural equation is then found to be 


fe = CRI =e Bi (Eo = wa)??*(Es a TAS (48) 
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A second example involves ordinary, joint, and chain regression. 


a Nio n 


Mb | 


N2b 
9 1, 


The structural equations are 
mM = a Biléa a ba) GG cat lie) (49) 
Cae B2(&, a bs)? (m1 a Oy) 
Reduction by use of the algorithms leads to 
Mm = Oy = Bi (E. Gz, Ha) *(b Pa pee (50) 
a. = Oy “BoB (Eg — ey 


A final example is one involving feedback. The path diagram is as below. 


Ne 


We have the structural equations 


m = a+ BilE are Ma) *(m2 ps ays 


(51) 
Ne = a2 + Bol—, — us) **(m = CN 
Reduction by (47) then gives 
Mm = a 1. fs ieee soe rag 
: ae §1a/(1—612601) i 61260b/(1—51i2601) 
(é, i) (é, iy) ? (52) 


—_ 8a1/(1—612841) g1/(1—512601) 
No = Oy + By 12091 Bo 129021 


-(E — yp Nhe S hE Ce C8 a py ere 
a a. . 


Let us assume independent additive normal errors in the observable 
variable y and non-existent errors in the observable variable x so that we 
have x, = &; ford = 1,2, --- ,q, andy; = »; + ¢«;forz = 1, 2, --: , p. 
We further assume that the e; are uncorrelated from one variable to 
another. Then after reduction we will have p independent non-linear 
regression equations, each of the form 


] = By aie B, (a, na M.)”*(a, = M,)”” sey a, ra M,)”" (53) 


where DS, D, , +++ , D, are functions of the &’s and B, is a function of 
the 6’s and 4’s. 


Now precisely the same problem of “‘identification” which plagues 


REGRESSION AND PATH ANALYSIS 141 


linear path analysis arises again here. The problem of estimation is 
much simplified if we can confine our attention to finding the B’s and 
D’s. In certain cases it is then possible to solve for estimates of the 
original parameters in terms of just the B’s and D’s. When this is 
possible, we say the system is just-ddentified. See Turner and Stevens 
[1959] for discussion. In the present instance we will restrict con- 
sideration to the just-identified case. 

Again we make use of the Gaussian iterant. We first obtain trial 
estimates Mao, Myo, --+ , Mao, Dao, Dio, -** , Doo. Then we compute: 


Zio = (e2 ae May a: ei Mee: ee (a a May "5 
Z20 = VAY AC A — M .o), 
Z30 Z10/(o = M0), 


Zgs3.0 = Z10/(q = M wo); (54) 
Lage6 as Z10 log, (2, a. M 40), 
Vie = Zio log, (2, — Mo); 
oes PE = Z10 log, (@, — M wo); 
for all observational vectors. In addition we define: 


B, = —Byo DiwWAM, a M wo); Beis = Bio(D, a Die) 
B, —Byjo Dyo(M, as M0), Bass = B,(D, a Dyo), (55) 


Barr = —Byy Dwo(M, i M qo); Bog+1 ar B,(D, a Dw): 


Using the device of the total derivative we obtain the approximate 
linearization 


gy = By a3 BZ yo ae BZ 20 =e as of BoqiiZoqr1 (56) 
with which to replace (53). By the usual methods of linear regression 
we now obtain the estimates B, , B, , --: , Bog: and from (55) we 
get the improved estimates M, , --- , D, . These now can be used 


to replace M,o , --- , Dao in (54) and the entire process reiterated until 
stable estimates are obtained. 

If any subset of parameters is known a priori then, as in the simple 
regression case of Section 3, the appropriate Z’s are simply omitted 
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and the analysis carried out without them. For example, if it is known 
that all processes have vertical asymptotes zero, then we omit Zz 
through Z,41,0- 

When there is more than a single response variable, the application 
of (56) separately to each provides multiple estimates for the vertical 
asymptotes M, , M,,--- , M,. In order to provide unique, efficient 
estimates of these values, we fit, instead of (56), the family of hyperplanes 


OP = 1By ep + Be Linns = BZ 50: a pis Si Ba414o+1,0: 
ae Brea Gata si aces = Beas: (57) 


where 2 = 1, 2,--: , p and B, , B; , «-: , Bags, are common for all 2. 
Linear regression theory again provides the minimum variance un- 
biassed estimators for fitting (57). 

Now the final estimates of the B’s, M’s and D’s will not be minimum 
variance unbiassed estimates but they will be maximum likelihood 
estimates and hence they will be consistent, efficient and invariant 
under transformation. Thus, since we have assumed a condition of 
“‘just-identification” we can solve for estimates of the 6’s and these 
estimates will also be maximum likelihood estimates. 

If we were to have an under-identified system, then the methods 
of this section would be as satisfactory as any. However, if we were 
to have an over-identified system, another procedure would be re- 
quired. Perhaps it would be best in that case to simply determine the 
maximum likelihood estimators directly from the special case being 
considered. A general computing method appears to be too cumber- 
some for all possibilities of over-identification. 

Asymptotic tests and confidence limits for the just-identified case 
are found as before by applying linear results to the final iteration. 
The parabolic test is also applicable to the general case of this section, 
but, of course, the numerical difficulties are much increased. These 
may not be prohibitive in the simplest cases, however. 
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ON THE ANALYSIS OF SPLIT-PLOT EXPERIMENTS 


H. Leon Harter 


Aeronautical Research Laboratory, 
Wright-Patterson Air Force Base, Ohio, U.S.A. 


Summary. 


A crucial question in the analysis of split-plot experiments is whether 
or not the interaction between subplot treatments and replications 
should be pooled with the three-factor interaction of main-plot treat- 
ments, subplot treatments, and replications, the result being called sub- 
plot error. A brief history of the disagreement over this question is 
given, along with a rule for deciding, on the basis of a preliminary test 
of significance, whether or not to pool. Several numerical examples 
are cited, and one of these is worked out in detail. 


1. History. 


The simplest type of split-plot experiment is one in which a levels 
of one treatment (the main plot treatment A) are arranged in a random- 
ized blocks design with r blocks (replications R), and b levels of a second 
treatment (the subplot treatment B) are assigned at random, one to 
each of b subplots in each of the ar whole plots. It is assumed that a 
correlation p exists between the experimental errors for any two sub- 
plots in the same whole plot, but that experimental errors for subplots 
in different whole plots are uncorrelated. Historically, most statisti- 
cians have assumed that interactions between treatments and replica- 
tions do not really exist. Hence they have computed a sum of squares 
for subplot error without showing a breakdown into sums of squares 
forB X Rand A X B X R, and have used the mean square for subplot 
error for testing the significance of both B and A x B. Some (see, 
for example, Federer [5, p. 274]) have even asserted that B * R and 
A X B X RF are confounded with each other and hence are really not 
separable, though it is possible to compute the two interactions arith- 
metically. On the other hand, Anderson and Bancroft [1, p. 350] have 
opined that if the subplot treatments are considered to be random, it 
is necessary to separate B X R and A X B X R. Among those who 
have considered the case in which the interactions between treatments 
and replications are not assumed to be zero are Wilk and Kempthorne 
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(10, pp. 101-102]. Chew [8, pp. 45-49] has assumed only that the 
three-factor interaction is zero. In either of these cases, B X R and 
A X B X R must be separated, since the expected values of their mean 
Squares are not identical. We shall consider cases in which it is not 
known a priori whether or not the interactions between treatments 
and replications are zero. We shall assume that R (replications) is 
always a random factor, and consider four cases: (1) A and B both 
random; (2) A fixed and B random; (3) A random and B fixed; and 
(4) A and B both fixed. Of these, (4) seems to occur most often in 
practice. We shall define the variance of an effect involving a fixed 
factor having f levels with (f — 1) instead of f in the denominator. 
The expected mean squares are summarized in Table 1, along with the 
test ratios for A, B, and A X B. 


TABLE 1 


SUMMARY OF ExPECTED VALUES OF MEAN SQUARES AND Test Ratios 
FOR SPLIT-PLOT EXPERIMENTS 


Expected Value of Mean Square 


Mean (all factors random)* 

Source Squarau|iace to" ot wiko (aro wor. (oops a? 
Main-plot 

Treatments A Sa br b [r] fl] 1+6-—I1p 
Replications R S; ab (b) fa] [((1)] 1+6— 1p 
Au aely Sar b {1] 1+6—I1p 
Subplot 

treatments B Sp Gr UP) @ (1) l—p 
A XB Sap r 1 1 — p 
BXR Sor a by al 
IND 15S Yea te Sabr 1 1 — p 


*For brevity, we show multipliers of components of variance in the table; thus, for example, 
E(Sabr) = o%abr + (1 — p)o%. If A is fixed, omit components whose coefficients are enclosed in pa- 
rentheses. If B is fixed, omit components whose coefficients are enclosed in brackets. (It is assumed 
that R is always a random factor.) 


Test Ratios for A 
If B is fixed, use Sa/Sar (exact test). If B is random, use Sa/Sar (assumes oat? = 0) or Sa/Sap 
(assumes o2ar = 0, p = 0). Satterthwaite test: Sa/(Sar + Sav — Sabr); Cochran test: (Sa + Satr)/ 
(Sar + Sad). 
Test Ratios for B 
If A is fixed, use S1/Sor (exact test). If A is random, use S1/Sbr (assumes oat? = 0) or Sp/Sad (as- 
sumes o25r = 0). Satterthwaite test: St/(Sor + Sas — Sar); Cochran test: (S» + Sabr)/(Sor + Sad). 


Test Ratio for A X B 


Use Sat/Sabr (exact test) 
(One ig usually not interested in testing significance of R and its interactons.) 
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2. Decision Rule for Pooling. 


We find in Table 1 that the expected value of the mean square for 
the A X B X R interaction is o2,, + (1 — p)o. The expected value 
of the mean square for the B X R interaction is aa;, + 0%, + ( — po 
if A is random, but it is ao;, + (1 — p)o’ if A is fixed. In either case, 
the expected values of the mean squares for B X Rand for A X BX R 
are not identical; hence they should not be pooled unless this action is 
indicated by the results of a preliminary test of significance. This 
preliminary test is a test of the null hypothesis H, : E(S,,) = E(S.;). 
Since R is assumed to be random, both S,,/H(S,,) and Sasr/ Sarr) 
have central x’/d. f. distributions. Hence under H, the test ratio 
S,,/Sas, has a central F distribution. The form of this preliminary 
test depends upon whether A is random or fixed. If A is random, 
H, reduces to Hj: oj, = 0, the expected value of the mean square for 
B X Ris greater than or equal to the expected value of the mean square 
for A X B X R, and the required preliminary test of significance is 
a one-sided test on the ratio of the mean square for B X R to the mean 
square for A X B X R. If this ratio does not exceed a certain per- 
centage point of the F distribution, the two mean squares are pooled; 
otherwise, they are not pooled. This problem has been studied by 
Bozivich, Bancroft and Hartley [2], who recommend a preliminary 
test of size a, where .25 < a < .50. On the other hand, if A is fixed, 
H, reduces to H¢’ : o2,, = ao;, , nothing can be said about the relative 
magnitude of the expected mean squares for B X Rand A X B xX R, 
and the procedure recommended by Bozivich, Bancroft and Hartley 
must be modified slightly; the preliminary test of significance is a two- 
sided test of size a, with probability a/2 in each tail of the F distribution. 
In this case, it is not possible to make a test of size a = .25 without 
computing a special table, since no table of the upper 12.5 percent points 
of the F distribution is available. Tests of size a = .20 anda = .50 
are possible, using tables of the upper 10 percent and 25 percent points 
of the 7 distribution tabulated by Merrington and Thompson{7]. 


3. Hxamples. 


Five numerical examples have been taken from the literature and the 
subplot error for each has been subdivided into BX Rand A X BX R 
interactions. In each of these examples, it appears reasonable to assume 
that both A and B are fixed factors, so that the preliminary test of 
significance will be two-sided. Table 2 gives, for each example, the 
source of the data, the mean squares and the numbers of degrees of 
freedom for B X # and for A X B X R, the ratio of the larger mean 
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TABLE 2 
PRELIMINARY TxEstTs FOR Pooutne In Spiit-PLor ExpERIMENTS 


BEX ALB eR Critical 
Interaction Interaction Larger M. S. Values 
Source Ratio = 

Code MVIERS Seedt: IETS colts Smaller M.8. | Fo F.25 
(1) 0.0367 15 0.0236 30 1.56 7 We? 
(2) 15-29) 6 25.50 42 iL AGH Peis) athe) 
(3) 28.08 9 ish lites Ar ale ey By 
(4) AT9E2T 206.02 30 1 7} 1.87 1.40 
(5) 19.36 70 21.03 140 1.09 UBM. a AG 


Sources: (1) Snedecor [9, Table 11.35, p. 310], (2) Federer [5, Table X—2, p. 276], (3) Anderson 
and Bancroft[1, Table 23.2, p. 348], (4) Rider [8, Exercise 4, p. 191], taken from Yates [11], (5) Cochran 
and Cox [4, Table 7.5, p. 225]. 


square to the smaller and the upper 10 percent and 25 percent points 
of the F distribution. If we use a preliminary test of size a = .50, 
we conclude that B X Rand A X B X R should not be pooled for the 
data from sources (1), (3), and (4). Even if we take a = .20, we decide 
not to pool in the case of the data from source (8). Let us now look 
at this example in greater detail. The data represent yields (in bushels 
per acre) of corn in an experiment conducted to compare four methods 
of seedbed preparation and four methods of planting, using four blocks. 
The methods of seedbed preparation are the whole-plot treatments A, 
the planting methods are the subplot treatments B, and the blocks 
are the replications R. Since there are two fixed factors (A and B) 


TABLE 3 
ANALYSIS OF VARIANCE FOR DATA FROM SouRCE (3) ABOVE 


Source of Variation d.f. 5.5. M.S. F 


Seedbed methods (A) 
Replications (/?) 


194.56 64.85 64.85/17.58 = 3.69 
223.81 74.60 


AXR 158.25 17.58 
AXB 221.74 24.64 24.64/13.18 = 1.87 
BXRE 252.72 28.08 


3 
3 
9 
Planting methods (B) 3 | 4107.39 | 1369.13 | 1369.13/28.08 = 48.76** 
g 
9 
7 


AXBXE 2 395.75 13.18 


Total 63 5514.22 a 


**Significant at the one percent level. 
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and only one random factor (R), the fixed main effects and interaction 
are tested by their interactions with the random factor, as shown in 
Table 3. Anderson and Bancroft give the correct test for the whole- 
plot treatments A, but test the subplot treatments B and the inter- 
action A X B by the subplot error obtained by pooling BX R and 
A X BX R. In this case they reach the correct conclusions (B highly 
significant, A x B non-significant), but clearly this will not always 
be true. 


4, Strip Plots and Sub-subplots. 


The expected mean squares are different for strip-plot experiments 
than for ordinary split-plot experiments, but the same tests are ap- 
propriate (see, for example [4], [6]), and the same preliminary test for 
pooling can be used. The theory can also be extended in an obvious 
way to split-split-plot experiments. If it cannot be assumed that the 
whole-plot treatment A, the subplot treatment B, and the sub-subplot 
treatment C do not interact with replications, the sub-subplot error 
should be partitioned into four partsC X R, AXCXR,BXCXR 
and A x B X C X R. In the most common case (A, B and C fixed, 
and R random), these are appropriate for testing C, A X C, BX C 
and A X B X C respectively. They should be pooled only if this is 
indicated by the results of appropriate preliminary tests of significance. 
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156 NOTE: Confidence Intervals for Recombination Experiments 
with Microorganisms 


A. W. KimBatu} 
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1. METHODOLOGY 


In a typical recombination experiment, a parent bearing a bio- 
chemical mutation at the A locus is mated with a parent bearing a 
biochemical mutation at the B locus, both loci on the same chromosome. 
The two parental types are thus (a+) and (+0). The loci (or markers) 
are usually identified: with nutrient elements such as adenine or lysine, 
and a mutation at one of these loci signifies that the organism carrying 
the mutation will not grow unless the corresponding nutrient element 
is present in the growth medium. Progeny from such matings are of 
four types. If no recombination takes place on the chromosome between 
the loci, progeny must be of the parental types, and the two classes 
have equal expected frequencies. If a single recombination takes place 
in this region, the progeny will be (++) or (ab) and these types like- 
wise have equal expected frequencies. Types (++) and (ab) are called 
recombinants and types (a+) and (+b) are called non-recombinants. 
Clearly the frequency of occurrence of recombinants is dependent on 
the distance between the loci, assuming that recombination is equally | 
likely to occur anywhere along the chromosome, and obtaining informa- 
tion about distances between loci is one of the main purposes of a 
recombination experiment. The proportion of recombinants is called 
the recombination frequency and the proportion of the (++) type 
is called the prototroph frequency. 

Since, ordinarily, the four types of progeny cannot be separately 
identified, platings are made both on complete media (containing ele- 
ments A and B) and on minimal media (A and B omitted). All four 


1Present address, Department of Biostatistics, The Johns Hopkins University. 
2Operated by Union Carbide Corporation for the U. 8. Atomic Energy Commission. 
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types will grow on complete media but only the prototroph (++) 
will grow on minimal media. Thus, if a count of x organisms is found 
by plating v, ml of a suspension on a minimal medium and a count of 
y organisms by plating v, ml on a complete medium, the prototroph 
frequency is estimated by 


KR = v,2/0,y. 


If the total count, S = x + y, is large and neither x nor y is too small, 
a confidence interval can be computed from the formulas, 


fy, =0(T — Aj/,{l+ A), Ry =»(T+ A)l— A) (2) 


where R, and Ry , respectively, are the lower and upper confidence 
limits, T = z/y, A = t,(T/S)’, and where ¢, is the normal deviate 
corresponding to a 100(1 — a) percent confidence interval. 

As a numerical example, consider an experiment that yielded the 
values v, = 180, = 561, v, = 2.5, y = 3759 for which 95 percent 
confidence limits (¢.5; = 1.96) are required. We have 


(2.5) (561) 
~ (180)(3759) 


T = 561/3759 = .14924, S = 561 + 3759 = 4320, 
1.96(.14924/4320)’”” = .01152, 


(2.5)(.14924 — .01152) 
180(1 + .01152) 


2.5(.14924 + .01152) 
18001 — .01152) 


Rk 


002073 


oe 
I 


R, = = .001891, 


Ry = = 002259. 
These limits are not, in general, symmetrical about the estimate. 

In many cases, the experimenter will have preliminary information 
about the expected orders of magnitude of the ratios x/v, and y/v, 
and may want to know how large an experiment is required to yield 
a reasonably precise estimate of the prototroph frequency. One measure 
of the precision is the relative width of the confidence interval, 


W = (Ro — R1)/R. 


Ordinarily, one is interested in relative widths of not more than 30 
percent (W. < .30), and in this case the relation, 


S=460 4+ 7)/TW’, (2) 


holds approximately. Both the total count S and the ratio of the 
counts 7’, are important in determining the precision of the estimate, 
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and it can be shown that if the counts are in al : 1 ratio, the total 
count required for a specified relative interval width will be a minimum. 
With this in mind, if we put 7 = 1 in (2), the total count required 
is given by 


S = 16¢,/W’. (3) 


Thus the procedure is to decide on the confidence level and interval 
width desired and to calculate the total count from (3). Then, knowing 
that the counts should be in a 1 : 1 ratio and having preliminary in- 
formation about z/v, and y/v, , the amounts to be plated out can be 
calculated directly. If the amounts calculated turn out to be ex- 
perimentally unfeasible, one might have to depart from a 1 : 1 ratio 
for the counts, in which case, the total count can be calculated from (2) 
for whatever value of 7 is found to be acceptable. Some trial and 
error with the use of (2) may be necessary in these cases. 

As an example, consider a situation in which it 1s expected that 
the counts from a standard suspension will be about 2 per ml for x 
and 5,000 per ml for y. A relative width of about 20 percent is desired 
for a 95 percent confidence interval. Thus t, = 1.96, W = .20 and 


S = 16(1.96)’/(.20)” = 1540, 


whereupon the amounts to be plated should be chosen so that approxi- 
mately 770 counts will be obtained from the platings on each medium. 
In other words, v, = 770/2 = 385 and v, = 770/5,000 = 0.15 are 
the amounts to be plated on minimal and complete media, respectively. 

Although sample sizes determined by this method may be used as 
a rough guide in designing recombination experiments, they should 
not be regarded as optimum. Readers interested in statistically more 
sophisticated procedures for determining sample sizes should refer to 
Birnbaum [1954]. 


2. DERIVATION 


Since v, and v, are known constants, the statistical problem consists 
of finding a confidence interval for the parameter @ = u/v, where 
uw = E(x) and y = E(y). Assuming that x and y are independently 
distributed Poisson variables, Birnbaum [1954] showed that a confidence 
interval for 6 may be constructed by treating p = y/(x + y) as a 
binomial variable with E(p |x + y) = 7 = v/u + v. Limits on 7, 
say (mo , 7), are easily obtainable from published tables of the binomial 
distribution, whereupon limits on @ are given by 


6; = Clin) = ie andi ey =n eae 
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In many cases, however, « + y is large and the normal approximation 
to the binomial distribution may be used to calculate the limits on 7, 
providing 7 is not near zero or one. This in turn leads to the limits 
given in (1). 

The relative width of the confidence interval for the prototroph 
frequency can be written as 


W=W,1+4+ P/Tl — (W2/4)] 


where W, = (a, — m)/p, the relative interval width for 7. In most 
practical situations W, will be small, say < .30, in which case W & 
W,{1 + (1/T)]. When the substitution is made for W, , the result 
can be expressed as in (2). For fixed W, this expression has a minimum 
at T = 1, and (8) follows directly. I am indebted to the referee for 
pointing out that if W < .30, W, will be about half as large as W 
when 7 = 1, and the approximation will be even better than indicated. 


REFERENCE 


Birnbaum, A. [1954]. Statistical methods for Poisson processes and exponential 
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INTRODUCTION 


An important problem of estimation arises, chiefly in genetical con- 
texts, when the proportion of individuals bearing some particular 
character must be estimated from the frequencies observed in a random 
sample of groups or families which have at least one individual bearing 
this character. The estimation by maximum likelihood of the proportion 
p in this situation has been discussed principally by Haldane [5], [6], 
Fisher [3] and Finney [2]. 

In conjunction with D. G. Catcheside’s studies on tetraploid maize [1] 
the author derived an iterative procedure, which is presented here, with 
a simpler mathematical form and rather better convergence properties 


154 BIOMETRICS, MARCH 1961 


than the standard method based on score and information, which is 
described by Finney [2].’ 

It was found preferable to estimate, instead of p, a parameter M 
related to p by the equation X/M = p, where X is the total number of 
character-bearers in the sample. The parameter M may be thought 
of as the total number of individuals that might have been observed 
if families with no character-bearers had also been ascertained in the 
sample. In this respect the method described here is a special case of 
the general method given by Hartley [7], in which the application of 
maximum likelihood to data with missing (or coalesced) classes 1S 
facilitated by estimation of the relevant frequencies for those classes. 
A similar approach has also been used by Fyfe [4]. 

The formula given below, however, leads to less computation than 
Hartley’s procedure, when several different sizes of family are involved. 
In the present instance accelerated convergence has been obtained by 
applying Newton’s method of approach by tangents to the solution 
of the equation M = M*(M), and requires, for each cycle of computa- 
tion, only a simple iteration in this equation. Hartley, on the other 
hand, employs the “approach to the geometric limit”, which differs 
from Newton’s method in that approximating chords are used in place 
of tangents, and thus requires ¢wo basic iterations in each cycle. 


THE ITERATION FORMULA 


The basic sampling distribution is the zero-truncated binomial. 
Accordingly, the likelihood function for the data, ysing the notation 
given in Table 1, is essentially 


L = X logp + (N — X) logg — Df, log A — 4’). (1) 
Ii M = X/p, m* = n,/(1 — q°), and M* = > mé,, the differential 


coefficient of the likelihood function with respect to M, may be ex- 
pressed in the form 


o. = 2 cy — M*). (2) 


1In Finney’s paper it is suggested, with reference to the tables of scoring functions provided, that 
linear interpolation may be used to obtain more accurate approximations to the maximum likelihood 
value ». As a matter of principle it may be noted that linear interpolation is insufficient for this 
purpose. Linear interpolation, between the entries for pi and pz say, will determine an estimate 
intermediate to the two estimates based on 1 and pe respectively, but generally the latter estimates 
will both lie on the same side of the maximum likelihood value #, even when 71 and p2 are themselves 


on opposite sides of . More accurate interpolation, based on second differences as well, is therefore 
necessary. 
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The factor p/Mq is essentially non-zero. Application of Newton’s 
method, in which the successive approximations to the zero of f(z) 
are given by the formula 


Si Sel) ge BS (f(a) /f’ (xx)], 


to the factor (IJ — M*) gives the formula 


My. = (MF —7.M,)/(1 — Tx); (3) 
where 
; p m*(m* — n,) 
A ee P 
TABLE 1 
Novation 
Number of Number of 
Size of group, s Frequency individuals character-bearers 

2 fe ny = 2fe Hip) 
33 fs i 3fs v3 
s ie Ns = She abs 
Total F N xX 


Estimation is completed by determining # as the proportion of X 
to M, and the amount of information in relation to p may then be 
determined as 


~My _ 
1 ae py (1 Ds (4) 


evaluated at the point of maximum likelihood. The amount of infor- 
mation differs from that for simple binomial data by the factor (1 — r). 
The factor r thus emerges as the relative amount of information lost by 
truncation in the sample. This factor governs the rate of convergence 
of the iterative process, convergence being extremely rapid when r is 
small. Even when the loss of information is as high as 40 percent, 
however, as in the example below, a single cycle of computation, from 
a suitable starting point, will give a high degree of accuracy. 
A very good starting point is provided by the (iterative) solution 
of the much simpler equation 
M =N/(1 — ¢), (5) 
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where § is the mean size of families observed, or the equivalent equation 
for the Poisson distribution (if p is small), as used by Fyfe [4]. Indeed 
for many practical purposes the solution of (5) will itself be sufficiently 
accurate. In the example given below, in spite of the wide range of 
family sizes, the solution of (5), roughly M = 3,300, is within 3 percent 
of the correct maximum likelihood value, and, in the case of the data 
on human albinism discussed by Haldane [6], the relative error is 4 
percent. 


EXAMPLE 


Catcheside [1] gives the numbers of recessive sw seeds observed on 
24 cobs from tetraploid maize triplex for su—1* and pollinated by 
nulliplex. It was possible that the parent plants of one or more of the 
seven cobs observed with no su seeds might have been quadruplex 


TABLE 2 
ITERATIVE SOLUTION OF THE Equation M = M*(M) 
| 

S, = Ne x q Mio hi my 
18 if . 7935 87.2 83749 110.76 
65 2 .4338 114.8 52707 137.44 
74 1 . 3864 120.6 48235 142.95 
76 1 .3766 121.9 47294 144.20 
84 2 .38399 127-2 43709 149.22 
86 2 SSO 128.6 42856 150.50 
131 2 .1858 160.9 27508 180.71 
140 3 . 1655 167.8 25174 187.10 
7} 2 1083 194.0 18187 211.46 
181 2 0977 200.6 16808 217.57 
188 2 .0893 206.4 15688 222.98 
206 1 .0709 22107 131389 23716 
211 1 .0665 226.0 12507 241.16 
219 2 .0600 233 .0 11559 247 .62 
223 4 .0570 236.5 .11112 250.88 
251 3 .0398 261.4 .08433 274 12 
259 2 .0359 268.6 07794 280.89 

INS 3 PASS 2 = Bp} M* = 3077.2 M* = 3386.72 
S => m* (m* — ns)/fs So = 73,842 Seal 22276 
(all f, = 1, F = 17) ro = 0.3694 ri, = 0.41850 
1 — ro = 0.6306 1 — mn = 0.58150 
Mo = 2585 M, = 3,366 M, = 3,401.63 
po = 0.012766 pi = 0.009804 po. = 0.009701 


QUERIES AND NOTES 157 


rather than triplex. As a check, therefore, p was estimated from the 
17 cobs with one or more su seeds. The iterative calculations for these 
data are set out in Table 2, and give an estimate 17 = 3402 in very close 
agreement with the actual number of seeds observed, 3359, on all 24 cobs. 
The sensitivity of the data on this point is rather low, however, as the 
relative standard error of M (and #), determined by formula (4), is 
23 percent. 

To exhibit the convergence properties, and for comparative purposes, 
M, = N = 2585 has been chosen as the starting point for the com- 
putations in Table 2. In practice the value 3359 might have been 
chosen, or, lacking this information, the approximate solution 3300 of 
the simpler equation (5). A single iteration would then have given far 
more accuracy than actually required. The values of qg* were determined 
in this instance by logarithmic calculation, but this calculation can be 
avoided if suitable binomial tables are available. The values r, are 
determined essentially by the computation of a weighted sum of pro- 
ducts of the m* with the differences m* — n, (not shown). 

To compare rates of convergence the successive approximations to 
the solution of dL/dp = (M — M*)/gq, by adjustment in terms of score 
and information, were also determined. The comparison is shown in 
Table 3. It can be seen that the present method does very much better 
from a poor start than the other, requiring only one iteration instead 
of two. However the disparity diminishes, as one would expect, in 
the neighbourhood of the maximum likelihood value. 


TABLE 3 
Successive APPROXIMATIONS TO M = 3401.76 By APPLICATION OF NEWTON’S 
Mernop To THE Equations (i) M = M*(M), (ii) dL/dp = 0. 
RELATIVE ERRORS ARE SHOWN IN PARENTHESES 


My M, M, M; 


M = M* 2585 | 3366(1%) | 3401.63(0.004%) | 3401.76 (7) 
dL/dp =0 | 2585 | 3697(10%) | 3421.19 (0.6%) 3402.04 (0.008%) 


COMMENT 


The approach which led to the present method was somewhat dif- 
ferent from Hartley’s and may be of interest in other connections. In 
the simplest cases the differential of the likelihood function with respect 
to a parameter (or some related parameter), @ say, can be factorized 
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into two parts, one linear in 6 and the other essentially non-zero. More 
generally, 
dL /dé = g(l(0, x)).h(O, x) 


where 1(6, x) is linear in 6, g(0) = 0, and (8, x) is essentially nonzero. 
In such cases one has the explicit solution given by the zero of 1(0, x) 
and no iteration is necessary. It was argued that, in the less simple 
cases, a similar factorization might be sought, in which 1(6, x), though 
not. strictly linear, was in some sense asymptotically linear. In the 
present instance, for example, the function J — M* is asymptotically 
linear in M as € = X/F increases. A sketch graph of this function is 
shown in Figure 1. Curvature of the function is restricted by a tangent 


FIGURE 1 
Sxetcu Grapy or THE Function M — M*(M) 


(The intercept X(N — F)/(X — F) corresponds to Haldane’s estimate of p, [6], from data in which 
the probability of ascertaining a family is proportional to the number of affected individuals in the 
family.) 


and an asymptote which differ in slope only by the factor 1/@. Generally, 
it could be expected that iterative methods applied to 1(6, x) rather 
than to dL/d@ would give more rapid convergence, as the functions 
g and h in the latter will generally have an adverse effect on convergence 
by increasing curvature. The comparison is complicated, however, by 
the presence of one or more inflexion points in dL/dé. 
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7 GUMBEL, E. J. Statistics of Extremes. New York: Columbia University 
Press, 1958. Pp. xx + 375. $15.00. 


B. F. Kiwparu, New York Public Service Commission, New York, N. Y., U.S.A. 


This work is essentially a source book on the statistics of extreme values. 
Furthermore it is the only source which is comprehensive. Viewed as a source book 
it is well arranged and well documented. The bibliography includes over 600 books 
and articles, to which specific references are made throughout the book. Of these, 
34 are papers by the author, or written in collaboration. 

The first chapter deals with definitions and the basic concepts. In the second 
chapter, on order statistics, the theory of exceedances is set forth and handled in a 
straight-forward and concise manner. 

In Chapter 3 the exact distribution of the extreme values of a sample are intro- 
duced. Chapter 4 is given over to an analysis of the exact distributions of extremes, 
pointing out the essentially different types which are important. 

These first four chapters comprise nearly half the subject matter and include 
all the material dealing with “‘exact’’ distributions of extreme sample values. Per- 
haps the principal advances in the study of the distribution of extremes in the last 
twenty five years have been brought about by recognition that the asymptotic 
forms of these distributions are usually adequate as approximations to the exact 
distributions in the applied field, and are much simpler to deal with. 

The asymptotic forms fall naturally into three classes, of which the doubly 
exponential is of the most consequence since it is the asymptotic form which results 
from a universe having an “exponential” type of cumulative distribution function, 
such as the logistic, normal and Gamma distributions. A second class, or “second 
asymptote” as referred to by Gumbel, stems from distributions of the Cauchy or 
Pareto type. The third class is often referred to as Weibull’s distribution when 
used for the lower extreme. Both can be obtained from the first by suitable loga- 
rithmic transformations. The first ‘‘asymptote” and its applications are discussed 
in Chapters 5 and 6, covering about 100 pages. This class is of particular interest 
to biologists—for example, in studying extinction times of bacterial. The second 
and third asymptotes are treated in Chapter 7. Chapter 8 is devoted to the asymp- 
totic forms of the range and mid-range, and their applications. 


'Statistical Theory of Extreme Values and Some Practical Applications, U. 8. Dept. of Commerce 
N. B. 8, Applied Math. Series 33(1954) p. 43; U. S. Gov. Printing Office, Washington, D. C. 40c. 
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This reviewer finds the author very ingenious in deriving methods which he 
believes will facilitate application of the theory to practical problems. Some of 
these ingenious devices are open to criticism. For example, the discussion of the 
topic Fitting Straight Lines on Probability Papers (pp. 34-86) is more ingenious 
than logical. 

A formula is developed at the bottom of page 23 for the probability that a 
return period 7 will be reached in » trials. This probability is “distribution free” 
and for large 7 can be approximated by W(v) = 1 — exp (—v/T). From this 
Gumbel develops what he calls a “control interval’. 

T/X <v < TX, dX > 0, the probability that v will lie on this interval being given 
by P =e"'’* —e-*. This is ingenious, but, since the probability density function 
is monotonically decreasing from v = 0, the shortest interval carrying the same 
probability would be 0 < » < TN with P = 1 —e-*’. Hence, for P = .6827,’ = 
1.148 and the ‘“‘control interval” would be 0 < v < 1.1487 rather than (.3196)T < 
Di <a (dole) 1 

A similar criticism applies to the control intervals (5) on page 215. These 
intervals are centered at the mode. The ‘‘shortest”’ such interval for P = .6827 
would be given by A = —.917, A = +1.317, rather than by A = +1.141. This 
would change (8) to T(y + A) = 3.7327T(y) — 1.366, T(y — A) = 0.4007(y) + 0.300. 
Likewise for P = .9545, A’ = +3.0069 becomes A’ = —1.59 and 3.251. Thus, 
when the underlying distribution is asymmetrical, some adjustment should be 
made when possible. 

In presenting The Extreme Control Band of Table 6.1.6, p. 218, and in other 
references to the control band, besides noting the lopsided character of the ‘‘shortest 
interval” of fixed probability, the author might have pointed out that most of 
these formulae presume a correct estimate of the scale parameter a. As I pointed 
out in my contribution to the book, the uncertainty of the estimate of a will involve 
the variance as a function of y, the distance from the mode, and N, the size of the 
sample used in determining a [cf. (2) p. 235]. Thus the complete variance upon 
which a confidence band should be based will involve both the ‘‘variance of position”’ 
of the fitted line and the intrinsic variance of the order statistic (used as a basis 
for Table 6.1.6.)?. 

Other formulas for the ‘‘variance of position” are casually referred to, such 
as the “Control Curve of Dick and Darwin’, Section 6.1.7; and formula (5), p. 
228, which gives the sampling variance of an estimate x obtained by the method 
of moments. In using this latter formula it should be made clear that the o? in 
the denominator refers to the variance of the sampled values and hence might be 
replaced by s?. The formula (6) which follows it would appear to be in error. With 
(2?/6)/c? substituted for o? and o%(y) = a’o(x) the a should cancel out. Also 
the significance of (6) is not clear since y is usually assigned a value in making an 
estimate from (4) by the method of moments. 

The matter of the identification of the variance of the order statistic vis @ vis 
that of the sampling variance of the estimate (variance of position) needs to be 
made clearer. 

I have noticed only two rather obvious typographical errors: On p. 24, line 6, 
.4637 should be replaced by .04657, and on p. 215, line 1, .05450 should be replaced 
by .95450. 


2Henry Schultz, The standard error of a forecast from a curve. J. Amer. Stat. Asso. 25 (1930), 
139-85. 
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VESSEREAU, A. Méthodes Statistiques en Biologie et en Agronomie, Tome 
8 deuxiéme. Paris: J.-B. Bailliére et Fils, Editeurs. Nouvelle edition entiérement 
refondue, 1960. Pp. 540. 55 NF. 


P. Rosrnson, Canada Department of Agriculture, Ottawa, Ontario, Canada. 


This book, as the title implies, is intended principally for research workers in 
the biological sciences. There are now several good texts available in English, 
but there are few in the French language. In his introduction to the first edition 
the author says “Le pfesent ouvrage se propose de combler cette lacune.” That 
there was this need may perhaps be judged by the short time elapsing between 
the first and second editions. The second edition has been completely revised and 
now includes— 


(i) a fuller discussion of the underlying mathematical reasoning, 
(ii) a more general description of the analysis of variance. 
(iii) a more extensive treatment of experimental design, including confounding, 
and 
(iv) tables and graphs relating to the binomial and Poisson distributions. 


This book contains a great deal of useful information and the mathematical 
treatment is kept simple throughout. Basic ideas on probability and sampling 
are discussed in the first four chapters, simple tests of significance in the next two 
chapters, experimental design in the next nine, and regression and correlation in 
the last two. 

In covering such a wide field within the confines of a single book, any treatment 
in depth is of course precluded. The broad approach to the subject may have led 
the author to include descriptions of higher order moments, the 6: and 62 measures 
of skewness and kurtosis, and Sheppard’s corrections. These, however, are com- 
pletely unnecessary in an introductory text of this nature intended for the biologist, 
as is evidenced by the fact that they do not reappear after Chapter II. It is also a 
pity that outdated measures of dispersion—the probable error (pp. 56 and 100) 
and the mean error (p. 100)—are discussed. These are interesting from a historical 
viewpoint, but only serve to confuse the agricultural worker. 

It would have been better to sacrifice breadth of approach in these areas in 
order to deal a little more fully with problems of more direct concern to the biologist. 
In particular the sections on transformations, split plots, and the combined analyses 
of repeated experiments could have been expanded. A discussion on expected mean 
squares might have helped when dealing with this last problem; the correct choice 
of the appropriate error for testing a particular hypothesis would then have been 
clearer, instead of appearing to be left to the whim of the experimenter (p. 419). 

A fuller discussion of the difficulties involved in multiple comparisons would 
also have been helpful. Comments (2) and (3) on pp. 187 and 188, for example, 
appear to be contradictory, and the discussions on pp. 203, 204, 206 and 216 could 
be misleading. 

The use of artificial data in certain illustrations (pp. 193, 458) is unwarranted 
when so many examples are now available. This is particularly dangerous when 
dealing with the analysis of covariance because the interpretation can be tricky. 
The author’s suggestion of circumstances in which a covariance analysis could be 
used—p. 451—“Lorsqu’ on étudie l’influence des engrais sur l’indice de pureté du 
sucre extrait d’une culture de betterave sucriére, les résultats dépendent du poids 
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des racines, qui n’est pas constant dans les différentes parcelles de l’essai, mais qui 
peut étre mesuré aprés coup’’—is a particular case in point. 

Finally I must disagree with a piece of advice which the author gives in his 
“En guise de conclusion.” He suggests, if there is any doubt in the mind of the 
research worker as to the applicability of some of the techniques discussed, that 
“mieux vant encore s’en tenir aux schémas éprouvés.’’ This value of this advice 
depends entirely upon what is meant by “‘schémas éprouvés’’. In general, it would 
be far better to advise the research worker ‘‘de demander conseil aupres des experts 
en statistiques.” 


LOVE, A. G., HAMILTON, E. L. and HELLMAN, I. L. Tabulating Equipment 
9 and Army Medical Statistics, Washington, D. C.: Office of the Surgeon-General, 
U.S. Army Medical Service, 1958. Pp. x + 202. 36 Figures. $2.00. 


W.W. Houianpn, London School of Hygiene and Tropical Medicine, London, 
England 


This is a delightfully written account of the development of machine methods 
and of the techniques of recording in the American Army. The book not only 
traces the development of statistical services, but also shows how great the de- 
pendence of an efficient military medical service is on the collection, tabulation and 
assessment of data. After reading this short monograph it is easy to understand 
how the excellent description of such diseases as cholera, respiratory infection and 
of injuries sustained by American Army personnel during the two World Wars 
could be completed. It is of interest to read that, even in the first years of the 
existence of the American Army, during the Napoleonic Wars, acute respiratory 
disease was a severe problem. Although the treatment tends, at times, to be some- 
what too biographical, a good account of the development of punch cards, sorting 
and tabulating systems and of computers is given. Full details and illustrations 
of the various forms used for recording data are described as well as an account of 
the collection and analysis of anthropometric information obtained during routine 
assessment of personnel on enlistment and discharge. 


ABSTRACTS 


The following are abstracts of papers presented at the annual meeting of the Biometric 
Society EN AR, held jointly with the Biometric Society WN AR, the American Statistical 
Association, and the Institute of Mathematical Statistics in Stanford, California on 
August 23 to 26, 1960. Abstracts of additional papers presented at this meeting appear 

in other society publications. 


KLAUS ABT (U. 8. Naval Weapons Laboratory, Dahlgren, Va.). Analysis 


Ss of Variance of Differences Versus Analysis of Covariance. 


In analyzing paired data (x; y), classified in one or more directions, where 
x and y are observations of the same variable (and therefore usually observed at 
different times) at least two methods are available. First there is the analysis of 
covariance in which y is considered to be stochastically dependent on z, the “‘in- 
dependent” variable. A second method is the analysis of variance of the generalized 
differences, d = y — Bot, briefly called “analysis of differences”. The case Bo = 1 
represents the most important case of ordinary differences. 

It is shown that, if the condition E(bg) = Bo is satisfied (“expectation of the 
coefficient of error. sum of squares regression line equals 60’, which easily can be 
tested), the analysis of differences has a power equal to or even higher than that 
of the analysis of covariance. From many calculated examples, it seems that the 
condition mentioned is satisfied quite often. In many cases, therefore, the analysis 
of differences will be the appropriate method for the data to be analyzed, besides 
this including a minimum of computation. 


684 R. L. ANDERSON (North Carolina State College, Raleigh, N. C.). Recent 
Developments in Multivariate Analysis. 


One of the mest important recent developments in multivariate analysis has 
been the publication of some excellent textbooks on the subject, on both theoretical 
and applied aspects. This paper is in the nature of a discussion of what the writer 
feels is needed to improve multivariate techniques. The chief need is much more 
attention being paid to construction of experimental models, usually involving two 
or more equations, and methods of estimating parameters in these models, assessing 
their usefulness and testing hypotheses concerning them. Particular attention is 
paid to causal chains and to dynamic systems. For the latter, methods based on 
lagged dependency models and Laplace transform techniques are indicated. 

Some possible weaknesses of current multivariate analysis of variance procedures 
are indicated. Problems which should be studied in the following fields are outlined: 
discriminatory, canonical, factor and component analysis. Special attention is 
focused on the construction of selection indexes, and the selection of predictors in 
ordinary multiple regression analyses. 
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685 ROLF E. BARGMANN (Virginia Polytechnic Institute, Blacksburg, Va.). 
On the Problem of Ordering Variables in Tracing Significant Contributions. 


After rejection of some general linear hypothesis in multivariate analysis it is 
frequently desirable to ascertain how much each variable contributes to the ob- 
served differences. The step-down procedure—i.e., a process analogous to analysis 
of covariance in which we study the first variable, the second given the first, the 
third given the first and second, etc.—depends on the ordering of the variables. 
The present report discusses the use for this purpose of the correlations of a linear 
discriminant function a’X, where a’ is the eigenvector associated with the largest 
root of HE (H = matrix of 8.8. and S.P. due to hypothesis, E same due to error). 
A similar procedure is described for the discrimination of common factors, i.e., for 
those cases where any difference between groups is supposed to be in terms of those 
artificial components only which are shared by two or more of the observable vari- 
ables. A detailed demonstration study will be presented for illustration. 


EDWARD C. BRYANT and DONALD W. KING (University of Wyoming, 
686 Laramie, Wy.). Estimation from Populations Identified by Overlapping 
Sampling Frames. 


Consider a population of unknown size identified by p sampling lists, each with 
M; entires from which independent samples of size m; , 7 = 1, 2, --- , p may be 
drawn. It is assumed that the sampling procedure will reveal which of the p lists 
contain the individuals interviewed. An estimate of the total number in the popula- 
tion is desired, as well as the number of individuals appearing on each possible 
combination of sampling lists. Maximum likelihood and minimum modified chi 
square estimates are considered. By moderate restrictions, an approximation to 
the variance 1s provided by use of the Taylor’s series expansion around the expected 
numbers in each category of listing combinations. 


BRADLEY E. COPELAND (New England Deaconess Hospital, Boston, 
Mass.). Problem in Pathology With Statistical Aspects. 


The pathologist in his role in diagnostic laboratory medicine finds a compelling 
need for a broader comprehension of the fundamental principles of statistics for 
himself and on the part of the practicing physician, the medical technologist, and 
the medical research worker. 

Every physician must be equipped with certain statistical tools to properly 
utilize the laboratory measurements now available to him for diagnosis and treat- 
ment. 

There must be clear and rapid communication between the pathologist and the 
clinician concerning the variation in normal values and the reliability and inherent 
variability of laboratory measurements. 

The first major problem relates to the barrier of statistical nomenclature and the 
multiplicity of statistical formulas. Communication cannot be successful until 
pathologists and clinicians learn to handle with ease such fundamental statistical 
concepts as standard deviation, standard error, variance, the ¢ test, and the F’ ratio. 

Since medicine is concerned with the individual, there is a need to clearly 
identify the tools which separate the diseased individual from a normal population 
as opposed to the tools which differentiate normal and abnormal populations. 
Both are useful devices, but they are the source of much confusion in the minds 


of physicians. 
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Currently there is an also pressing need for more comprehensive definitions of 
normal values in all laboratory measurements—normal as regards variation in age, 
sex, geographical location, and any other environmental or hereditary factor which 
may influence the normal population. 

To achieve permanent indoctrination of the medical profession with useful 
statistical tools is a necessary and useful goal. It is in this area that active communi- 
cation between pathologists and statisticians should take place to develop solutions 
to the problems described. 


R. C. ELSTON (University of North Carolina, Chapel Hill, N. C.). On 


688 : 
Additivity in the Analysis of Variance. 


The problems of testing for non-additivity and of finding the best transformation 
of the data to minimize it are discussed. Consider the model: E(yq) = « + Ma + cua, 
where y, is an observation belonging to the g-th subclass in an analysis of variance 
problem. If this model fits our data, and if max nq — min pg < 1/| c | (the range 
over which this model is a strictly monotonic function of yu), then there exists a 
useful transformation of our data that will give additivity; that is the transformed 
data will be fitted by an additive model. Two tests that have already been pro- 
posed for testing for non-additivity are tests of the null hypothesis c = 0 against 
c ~ 0 under models that can be considered as approximations to the above model. 
It is shown how estimates of » and c can be used as guides in finding an appropriate 
transformation of the data. 


WALTER T. FEDERER (Cornell University, Ithaca, N. Y.). Augmented 


689 
Designs with Two-, Three- and Higher-Way Elimination of Heterogeneity. 


Augmented experimental designs are standard ones to which additional treat- 
ments have been added. The additional treatments may or may not be replicated 
tne same number of times as the treatments in the standard design or the same 
number of times as other new treatments. The groupings of treatments into rows, 
columns, complete blocks, etc. may be such that the size of the group is or is not 
equal. Construction procedures, randomization procedures and the analyses in 
general form have been presented for augmented experimental designs with two-, 
three- and higher-way elimination of heterogeneity. 


69 WILLIAM R. GAFFEY (California State Department of Public Health). 
Tests of Hypotheses Concerning Boundedness in Convolutions. 


Suppose the random variable Y, with unknown d.f. G(y), is observed subject 
to error, so that X = Y + Z is the random variable actually observed. Let H,(z) 
be the (known) conditional d.f. of the continuous “error’’ random variable Z, and 
let F(X) be the df. of X. We are concerned with deriving tests of the following 
two-hypotheses about the distribution of the ‘‘true” random variable Y ; (1) G(y) = 0 
for y <a, and (2) G(y) = lfory > 6. 

Under suitable restrictions on the form of H,(z), it turns out that hypothesis 
(1) is true if and only if F(X) < H.(a — a) for  < a, and hypothesis (2) is true 
if and only if F(X) > H,(« — b)forz > b. A test of either inequality is therefore 
a test of the corresponding hypothesis about G(y). 

A test is proposed using the one sided Kolmogorov-Smirnov statistic, restricted 
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to the appropriate portion of the range of X. The large sample distribution of the 
statistic is derived, and the consistency of the test is established. 


MAX HALPERIN (General Electric Co., Schenectady, N. Y.). Nearly 
Least Squares Estimates in Heteroscedastic Regression. 


Suppose we have observations y;;, 7 = 1, ---, k, jf = 1, t+, m 
where Hy;; = a + ba; and Var y;; = oj , the x; are presumed known, and the y;; 
are normally distributed, mutually independent, and k > 2, min n; > k. In this 
paper we show how to generate estimates of a and b which are nearly least squares 
in the sense that the correlation of the estimates is identical to that which would 
be obtained from a least squares fit with known variances while the variance of the 
estimates differs from that appropriate to the known variance least squares fit by 
a multiplicative factor which approaches unity as min; n; > © and k remains con- 
stant. The procedure further allows estimation of the covariance matrix of the 
estimates with (min; n; — k + 1) degrees of freedom and hence an exact confidence 
region for a and 6 based on Hotelling’s J? distribution. The results cited are of a 
conditional nature in which the conditioning constrains the variance multiplying 
factor referred to above. The distribution of the multiplicative factor aside from 
known constants is that of Hotelling’s central T? in general (Student’s ‘‘¢” if k = 3). 
As a consequence, alternative exact confidence intervals of unconditioned nature 
are available in principle by averaging the conditional results over the appropriate 
distribution. It is conjectured that matching moments to determine approximate 
degrees of freedom for an “equivalent” 7? distribution is adequate from a practical 
point of view. If k = 2, the procedure we propose gives estimates with exactly 
least squares variances and exact confidence intervals or regions. It is conjectured 
that results similar to the above generalize to general polynomial or polyvariable 
regression linear in the parameters and a sketch of the type of argument is given. 


R. C. HENNEMUTH (Inter-American Tropical Tuna Commission). Esti- 


692 
mating Vital Statistics of Yellowfin Tuna Populations. 


Knowledge of vital statistics is essential to the study of the dynamics of ex- 
ploited fish populations; age composition, growth rate and mortality rates being 
particularly important. 

Determination of age of yellowfin tuna by using marks on scales or bones has 
not proven reliable; however, growth rate and relative age have been estimated 
by observing the temporal changes in the size composition of catch. Absolute age 
has been approximately determined by comparing the average time of spawning 
with the time and size at which the age groups initially appear in the catch. 

Total mortality rate has been estimated by analyzing catch-curves of successive 
year classes, which have been computed from age composition and indices of apparent 
abundance based on catch per unit effort data. 


693 BURTON J. HOYLE and GEORGE A. BAKER (University of California, 
Davis, Cal.). Game Theory Applied to Field Trials. 

The determination of the proper variety of a cereal to grow may be considered 

as a game with the Grower as one opponent and Nature as the other. A game 

matrix of forty-one independent yield trials on nine strains of Hannchen barley is 

presented. The trials cover three years and very diverse designs and environments 
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within the years. A similar but more restricted matrix for kernel weight is also 
presented. The whole structure of field trials is made clear by a contemplation of 
these game matrices. Sometimes one strain does better and sometimes another. 
That is, it is doubtful that there is a best strain. Sometimes all strains are close 
together and sometimes the strains differ widely. Some strains are very erratic, 
in relative yield, and some are very consistent. Strains highest with respect to 
yield are not necessarily good with respect to kernel weight and other desirable 
properties so over-all ratings involve weighting considerations on various character- 
istics. 

It is clear from dominance considerations that some strains should seldom or 
never be grown. In other cases mixtures of strains may be better than any one 
strain. 

A determination of the odds for the different strains depends on the distribution 
of nature’s strategies. is 


AUGUSTUS CG. JOHNSON (Booz-Allen Applied Research, Inc.). A Stochastic 


08 Model of Incubation-Period Distributions. 


An independent-action birth-death model of infection is presented, based on the 
hypothesis that each organism has probability \ dt of dividing and y dt of dying 
during the time interval dt. \ and » being constants. Incubation terminates when 
a colony reaches a lower absorbing barrier at o or an upper absorbing barrier at 
some number JN, at which point a sign or symptom appears. 

The resulting differential-difference system is solved to give expressions for 
the distribution of conditional probability that m organisms will grow to an upper 
absorbing barrier V between times 0 and ¢. Expressions for the moments are derived 
both from the distribution and directly, being exhibited explicitly for moments 
through the third. 


CECIL L. KALLER and VIRGIL L. ANDERSON (Purdue University, 
605 Lafayette, Ind.). An Environmental Extension of Chromosome Analysis in 
Population Genetics. 


A phase model has been developed for the special case of a diploid diallelic 
genetic population having K pairs of chromosomes in genetic equilibrium which are 
influenced by Q tri-level environmental factors. The theoretical structure for this 
development is based on the theory of factorial experimental designs with each 
of the K +- Q factors having three ‘“‘levels.’”’ Estimates of the parameters of the 
population described by this model are obtained by extending the methods presented 
in a thesis by McKean (Purdue, 1958). This sampling method involves the formula- 
tion of a “chromosome population” by the random selection of a set of K chromo- 
some pairs from the original genetic population. The chromosome pairs so drawn 
are combined to form all possible genotypes; individuals of each resulting genotype 
are placed in all 32 possible environmental level combinations in numbers propor- 
tional to their theoretical frequency. Under the assumption that all interactions 
involving three or more factors are zero, an analysis scheme is developed for the 
estimation of chromosome population parameters. Therefrom, by the relationship 
demonstrated between the chromosome population parameters and the corre- 
sponding parameters of the original genetic population, conditions are established 
under which these latter are estimable. 
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696 MARVIN A. KASTENBAUM (Oak Ridge National Laboratories, Oak Ridge, 
Tenn.). Countercurrent Dialysis—A Stochastic Process. 


Peptides isolated from rat liver microsomes may be separated from contami- 
nating amino acids, sugars and salts by countercurrent dialysis. A single such 
dialysis can achieve only fractional purification. Therefore, more complex systems 
of multiple dialysis, in which a linear series of cells are used, have been devised. 
The system of immediate interest may be described as follows: Let each dialysis 
cell be a stage, and each dialysis period be a cycle. Then after each cycle, the 
dialout (the solution outside the dialysis sac), at each stage is concentrated and 
used as the dialin (the solution inside the dialysis sac) in the succeeding stage, 
whereas the dialin at each stage is concentrated and returned to the dialysis sac of 
the previous stage. The dialin at the first stage is retained at the first stage, and 
the dialout at the final stage is taken out of the system. In the nomenclature of 
stochastic processes, the first stage represents a reflecting barrier and the last stage 
an absorbing barrier. 

To determine the probability with which a particle of the isolate will be at a 
specified stage in the system after a given number of cycles have been carried out, 
the system is described in terms of a Markov process with a stochastic matrix re- 
sulting from the product of two matrices, namely, a diffusion matrix and a transfer 
matrix, and algebraically explicit solutions are derived for any number of stages 
and cycles. 


JOSEPH KEILIN (U.S. Dept. of Health, Education, and Welfare, Washing- 
697 ton, D.C.). The Use of the Information Statistic as a Measure of Conformity 
in Comparing Two Sets of Responses. 


A morbidity survey of the city of Nashville and suburbs was recently conducted 
by the Air Pollution Medical Program. In order to obtain some measure of the 
reliability of responses a subsample was chosen for reinterview. 

Two methods of looking at the data are discussed and two functions are pro- 
posed for measuring the proportion of information transmitted from the ordinal 
interview. The two methods dictate different null hypotheses and these in turn 
lead to the two different functions. 

The results from the reinterview are presented and show relatively low amounts 
of information transmission. Further analysis of the data, using information theory 
procedures, is suggested. 


JOHN G. KEMENY (Dartmouth College, Hanover, N. H.). General Remarks 


69 Say mae 
g on Markov Chains with Illustrated Examples in Biology. 


Part I of the paper deals with finite Markov chains. It is shown how, through 
intelligent classification of various types of chains and the use of modern matrix 
techniques, many fundamental problems can be reduced to routine computations. 
Two fundamental matrices are developed, one for absorbing and one for ergodic 
chains, in terms of which basic quantities can be simply expressed. As applications 
of these mathematical techniques, problems in genetics are considered. 

Part II of the paper deals with some recent joint research carried on with 
J. L. Snell, concerning denumerable Markov chains. It is shown that extensions of 
the above-mentioned matrix techniques to denumerable chains result in an interest- 
ing discrete analogue of classical potential theory. While the discrete version is 
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simpler than the classical potential theory, it can be vastly generalized by extending 
it to a wide class of denumerable Markov chains. This has been done previously 
for transient chains, but the extension to recurrent chains is new. The fundamental 
matrices extend to basic potential operators. Applications of these results are 
sketched to population problems and to the spread of epidemics. 

Throughout the paper, stress is laid on the methodological advantages of setting 
up a Markov chain model. A number of key results are stated without proof and 
several unsolved problems are mentioned. 


OSCAR KEMPTHORNE (Iowa State University, Ames, Iowa), and R. N. 
699 CURNOW (University of Aberdeen, Aberdeen, Scotland). The Partial 


Diallel Cross. 


The diallel cross, which is composed of all possible single crosses among a group 
of inbred lines, is a common plan of investigation in plant and animal breeding. 
It can be used to estimate general and specific combining abilities and variances, 
to estimate components of genotypic variation and to estimate yielding capacities 
of multi-way crosses. With n lines it does however involve n(n — 1)/2 crosses, 
even if maternal effects can be ignored. Breeding programs can easily lead to 50 
inbred lines which merit examination, so the diallel cross becomes unfeasible. This 
paper deals with sampling of the complete diallel cross table in a specific say, namely: 


(1) arrange the lines in random order 
(2) obtain the crosses of line 7 with lines 


bebt, bat 1 hee = es 
where k = (n + 1 — s)/2 is an interger and all number above n are reduced 
by a multiple of 7 so as to be between | and n. 


Each line is crossed then with s lines and s = n — 1 corresponds to the complete 
diallel cross. This partial diallel cross is considered in the following respects: 


(1) estimation of variance components 
(2) comparing the yielding capacities of all possible crosses and 
(3) estimation of general combining abilities. 


The efficiency of the plan with regard to each of these aspects is evaluated and 
discussed. It is shown to be more efficient than some other possible plans under 
some circumstances. 


BERTRAM 8. KRAUS (University of Washington, Seattle, Wash.). The 


700 
Study of Human Growth: Some New Horizons. 


Researchers in biometrics, genetics, and physical anthropology have a fertile 
field for miscegenation in the study of growth. An example is the timely and signifi- 
cant problem of the possible genetic effects of irradiation on future human genera- 
tions. Thus far workers in this field have failed to demonstrate that radiation 
causes mutations in the human species, although this can be assumed from experi- 
ments with other mammals. Perhaps this failure can be associated with insufficient 
knowledge or recognition of the nature of growth and the role of the genetic consti- 
tution in controlling or regulating growth. Perhaps, too, there has been too much 
emphasis placed upon clinical classifications of abnormalities (congential malforma- 
tions) and not enough on the less dramatic but none the less real deviations in 
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patterns and rates of growth. Equally significant is the failure to define abnormality 
in terms of the statistical characteristics of the given population. 

Attention is focused in this paper upon the distribution of certain skeletal 
phenomena in prenatal life—sequence of appearance of centers of ossification in 
the foot, changing long bone inter-relationships, and appearance of centers of cal- 
cification in the primary dentition. Norms for human populations are estimated for 
these traits. It is then suggested that significant deviations from norms established 
in control populations may reflect higher frequencies of mutant genes in a test 
population. A method is proposed for the study of the genetic effects of irradiation 
in Hiroshima and Nagasaki wherein the biometrist, the geneticist, and the physical 
anthropologist may pool their skills and concepts. 


B. KURKJIAN (Diamond Ordnance Fuse Laboratory) and M. ZELEN 
701 (National Bureau of Standards, Washington, D. C.). A General Theory for 
Analysis of Asymmetrical, Confounded Factorial Experiments. 


A general theory is developed for the analysis of factorial experiments involving 
an arbitrary number of factors and levels of each. The treatment-combinations are 
assumed to be assigned to an appropriate incomplete block design and the treatment 
estimates, 7’s, are computed in the usual way, ignoring the factorial correspondence 
between the treatments. New results are presented for obtaining the best linear 
estimates (and the associated var-cov matrix) of the various main effect and inter- 
action parameters in terms of the 7’s. A condition on the var-cov matrix of treat- 
ment estimates, V; , is provided which insures that the sum of squares of the various 
main effect and interaction terms is distributed as chi-square. Furthermore, a 
certain large class of partially balanced incomplete block (PHIB) designs with 
arbitary number of associate classes is described which can be treated by methods 
of this paper. The theory is applied to two PHIB designs involving three associate 
classes. : 


ROBERT 8. LEDLEY (George Washington University, Washington, D. C.). 


02 
é A Sequential Decision Theory Applied to Medical Diagnosis. 


The reasoning foundations of medical diagnosis involve two well-known mathe- 
matical disciplines; symbolic logic and probability theory. Value theory then can 
aid the choice of an optimum treatment. 

Three factors are involved in the logical analysis: medical knowledge relating 
disease and symptom complexes; the symptom complex of the patient; and the 
disease complexes that are the final diagnosis. Hach of these can be expressed as a 
Boolean function. The ease of making diagnostic tests varies greatly; a realistic 
method of diagnosis uses easier tests to select among more difficult tests. 

Using probability theory, a ‘most likely’ diagnosis is determined by applying 
Bayes’s formula to the conditional probability that a disease complex will produce 
a particular symptom complex and the total probability that any person chosen 
from the sample under consideration will have that disease complex. Certain 
problems arise in computing statistics for the probability determinations; the 
difficulty in obtaining ‘sufficiently large’ data and the time delay in gathering data. 

Value concepts in medical diagnosis and treatment are concerned, for example, 
with decisions on whether to continue further testing, or what are the values of 
particular alternative treatment—diagnosis combinations. We distinguish three 
kinds of problems in treatment decision; under certainty, i.e. when the disease is 
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definitely known; under risk, i.e. when alternative diagnoses have been made with 
known probabilities; under uncertainty, i.e. when alternative diagnoses remain 
with no information concerning the probabilities. 


K. H. LU (Utah State University, Logan, Utah). The Means and Variances 


ae of the Products of Two or Three Normal Variables. 


Given normal variables ¢; , f2 and ts with means wi, ws and ws, variances o11, 
oo2 and o¢: respectively. Let ¢: and 2 follow the bivariate normal distribution with 
moment generating function: 


2 2 
M (t,t) = exp. Dy Mal; + 4 Ds Oi;Kib; 
4=1 tj7=1 


Also let t; , t2 , and ¢; follow the trivariate normal distribution with moment gener- 
ating function: 


M(tytots) = exp. Y wi +4 oo ee 


tj=2 
We define new variables x and y as follows: 
t=tth, and y = ttt, respectively. 


It can be shown that by differentiating M(¢:t2) and M(titets) to obtain the appro- 
priate partial derivatives evaluated at ¢; = 0 and substituting into the definitions 
of means and variances, we have: 


Me = Mille + on , 

C2 > Fispa “fe Co2h + 2uiMeoig + 011022(1 He Pi2) 

By = Kibkob3 + [1023 Sle M2013 ae M3012 ; 

Cov = FriMoMs + oooMios + Gastila + oro2ema(l + pie) 
at O40 33M 1 =n pia) + F220 33h (1 Hz a) 
“te 0110220331 si 2 pis at 2pis =i 2 pos + 8 P1213 P23) 
ste 2( 012i Moms ae O 13MM Ms = C231 Mods) 
a Hib2(B023013 =e 4012033) = Hits(6o12023 “le 4013022) 
+ ob3(Bo12015 oe 403011). 


704 H. M. C. LUYKX and BETTY L. MURRAY (office of the Surgeon General 
U.S.A.F.). Durations of Illness. 


Using epidemiology in the broad sense of the word, one of the epidemiological 
characteristics of a disease is its duration pattern. Not only the mean duration 
will vary from one diagnostic entity to the next, but so will the shape of the distri- 
bution of durations, which can be described in several ways. In this paper a picture 
is presented for each of a number of specific diseases. For example, graphs are 
shown for 368 cases of infectious hepatitis, 446 cases of mumps, 879 cases of pilonidal 
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cyst and 2177 cases of streptococcal sore throat. These and other charts illustrate 
the techniques of analysis and presentation, and the use made of the data. 

The curves are “‘cases—remaining” curves, which are strictly analogous to 
survivorship curves commonly given with life tables. A cohort begins its episode 
of illness at the zero point on a scale of days, and diminishes as each case terminates. 
The cases—remaining curves are more useful than ordinary frequency polygons 
(on a scale of duration). They show at a glance the range and skewness of the 
duration distribution, the modal day for return to duty, and the median and other 
percentile points on the “days-after-admission”’ scale. 

The Air Force medical data are unique in that they provide unusually large 
frequencies, derived from uniform records and recording procedures, showing di- 
agnoses established by a physician in each case. In this presentation, the-beginning 
of an illness is excusal from duty by a physician, and the end is return to military 
duty; cases not returned to duty (deaths or disability separations from the service) 
are not included. 

Definitions of illness, diagnostic entity, disability, admission, return to duty, 
etc., are also discussed, and a technical note describes the method of computing 
confidence intervals for percentile points. 


CLIFFORD J. MALONEY (U.S. Army Chemical Corps, Ft. Dietrick, Md.). 


705 
Disease Severity Quantitation, II. 


It was previously shown (Disease Severity Quantitation. Paper read at 
Washington, D. C. 1959 Annual Meeting of the American Statistical Association) 
that symptoms of rats exposed to aerosol challenge of B tularense conform closely 
to a scaling model due to Guttman (Stouffer, S. A., Measurement and Prediction, 
1950, Princeton U. Press). 

Necessary and sufficient criteria of scalability are that symptom pairs yield 
four-fold contingency tables in which one, but not both, of the secondary diagonal 
cells are zero, and that the zero cells in all tables can be arranged in a linear order 
in either symptom designator. 

The present paper points out that (in the absence of experimental accidents) 
the sum of the four cells in every table is fixed, that each table divides the entire 
disease severity scale into three intervals. In the first, the disease severity is too 
slight for either symptom to appear. In the second, the more sensitive symptom 
appears, but not the other, and in the last segment, the disease severity is such 
that both symptoms appear. The lower boundary is fixed by the milder symptom 
and is located at the same severity level whatever the other member of the pair; 
similarly for the upper level. Hence, a system of linear equations can be formed 
in the segment frequencies and in the segment lengths. 

These sets can be equated and expressed in matrix notation. If the experimental 
data is perfect, the matrix will be triangular. If the data are subject to random 
error, the further treatment can be carried out in terms of a matrix subject to random 
error. 


H. E. MCKEAN and B. B. BOHREN (Population Genetics Institute, Purdue 
706 University, Lafayette, Indiana. U. 8. A.). Numerical Aspects of the Regres- 
sion of Parent on Offspring. 


The relative merits of three methods of estimating heritability from the 
regression of offspring on one parent’s record, from the point of view of statistical 
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efficiency, are brought out with respect to five generations of a closed poultry flock 
previously reported by Yamada, Bohren, and Crittenden (Poultry Science, 1957). 
The three methods discussed are: (1) The regression of offspring means on parent 
records. (2) The regression of offspring records on parent records, in which the 
parents’ record is repeated once for each progeny. (3) The Kempthorne—Tandon 
weighted technique (Biometrics, 1953), in which account is taken of the correlation 
coefficient p between errors associated with progeny of the same parent. 

The latter technique (3) depends upon knowing p exactly; in this case the 
estimation technique is optimum. If p is unknown, p must be estimated or guessed ; 
in which case (3) is not optimum. The loss in efficiency in misguessing p is discussed, 
over a range of possible values of p. A comparison is simultaneously made with 
methods (1) and (2). 

The general conclusions, based on these data are: (i) method 1 is distinctly 
inferior to methods 2 and 3 in most conceivable situations; (ii) if p is small (which 
it is if the genetic variance is primarily additive and maternal effects are minor), 
then method (3) is only slightly more efficient than method (2). Indeed, method 
(2) is simply a special case of method (3) with p always guessed to be 0. 


HARLEY B. MESSINGER (University of California, Berkeley, Cal.). A 


10% Geometrical Model for Height-Weight Control Charts in Schoolchildren. 


The primary objective of this study was to obtain a satisfactory screening 
device, i.e., a way to give a yes-or-no answer at any time tothe question, ‘“‘Are this 
child’s height and weight within the ranges one would expect knowing his age, sex, 
and previous heights and weights,’’ To accomplish this, a set of tolerance limits 
were derived from the combination of cross-sectional studies of two functions of 
height and weight in one healthy group of children and longitudinal studies of 
these functions in each individual of a second healthy group with respect to the 
cross-sectional standards set by the first group. Because the functions were approxi- 
mately independent, separate control charts could be used. 

A secondary objective of the study was to seek interpretability for the control 
charts. In an interpretable chart, a child’s ratings at any time can be related di- 
rectly to his physical characteristics. A class of geometrically-meaningful functions 
arising from the use of a cylinder to approximate the human body was examined 
in the Institute of Child Welfare ‘“‘Guidance Study” data. The height (H) of the 
cylinder and the diameter of the cylinder relative to the height (D/H) were es- 
sentially uncorrelated within each age group from 4 to 14 years. Individual children 
of the “Berkeley Growth Study” were studied longitudinally. Their H and D/Y 
functions at each age were studentized to obtain standard- or z-scores. From 
studies of these sequences as stationary autoregressive time series, reasonably 
efficient control charts were obtained. The D/H function, called “relative broad- 
ness,”’ can be gotten from a nomogram. No arithmetic computations or standard- 
score tables are needed for this kind of chart. 


708 CHARLES J. MODE (Montana State College, Bozeman, Mont.). On The 
Theory of the Improvement of Metric Traits In Outbreeding Populations. 


An attempt was made in this paper to characterize the change in the mean 
and genetic variance of a metric trait under a program of improvement. The 
approach used in this paper differs from other approaches by carefully distinguishing 


ABSTRACTS 175 


between the fitness of a genotype under a program of improvement and the metric 
trait. Only the single locus case with an arbitrary number of alleles was considered, 
but the model may be interpreted in such a way as to include the multi-locus case. 

Steady state populations and the problem of stability of a steady state were 
also considered. The existence of steady states and the question of their stability 
was discussed for constant and non-constant measure of fitness. 

The conditions for the existence of an optimal value for the mean of a metric 
trait when two or more alleles are maintained in a population under a program of 
improvement were also given. 


W. 8S. OVERTON and A. L. FINKNER (North Carolina State College, 
709 Raleigh, N. C.). The Sample Road-Block Method of Estimating Hunting 
Pressure. 


The sampling method described was developed for use on Game Management 
Areas in Florida, but is applicable in situations where estimates of hunting pressure 
are desired for areas having few or no inhabitants. The method involves the sampl- 
ing of access points. 

Under a given definition, it is possible to express ‘‘hunting pressure” in terms 
of total persons entering and leaving the area by time period. An expression is 
developed for the definition used in Florida, which forms the structural basis for 
sampling. Total hunting pressure is divided into a daily component and a com- 
ponent due to camping; only the former is considered in this paper. 

The daily component of hunting pressure is a simple linear combination of the 
time period totals of entries and exits. There are a number of sampling systems 
that will give unbiased estimates of the totals by time periods, and, hence, unbiased 
estimates of the daily component. A system in which an observer measures a 
randomly selected station at alternate time periods with the starting time randomized 
among days was chosen because of its simplicity and ease of operation. Time periods 
are considered strata, and the usual stratified sample estimates of totals and variances 
are employed. This system does not yield unbiased estimates of variance, but 
seems to be satisfactory for present purposes. 

Alternative sampling systems and corresponding estimates of variance are 
considered and evaluated. 


D. S. ROBSON (Cornell University, Ithaca, N. Y.). Cumulant component 
analysis in balanced designs. 


Cumulant components, as a direct extension of variance components, described 
the manner in which the usual assumptions on the linear model are violated. The 
unique, minimum variance unbaised estimators of cumulant components for the 
distribution-free, balanced model are easily determined because of the existence 
of a complete, sufficient order statistic. One computational procedure for con- 
structing these estimates is an algebraic extension of the analysis of variance to the 
“analysis of cumulants’’. 

Under the usual Model II assumptions of normality and independence of effects, 
the components of all cumulants beyond the second are zero. Tests of departure 
from zero, therefore, constitute tests of the usual assumptions. Several such tests 


are available. 
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B. V. SHAH (Iowa State University, Ames, Iowa). Asymmetrical Factorials 


711 
in Incomplete Blocks. 


The primary purpose of this paper is to summarize the recent developments in 
this subject and to indicate possible lines of attack for future research. After a 
brief summary on history of developments, the desirable properties of factorial 
experiments, namely orthogonality and balance are discussed. The analysis and 
the construction of the designs with above properties are studied. As a consequence 
of relaxing the condition on balance we obtain a wider class of designs, which may 
be called partially balanced factorial experiments. Finally, the use of pseudo- 
factors and some other aspects of these designs are studied. 


R. G. D. STEEL (Cornell University, Ithaca, N. Y.). A Variable Rank 


12 
‘ Sum Multiple Comparisons Test. 


A variable rank sum is used for the comparison of all pairs of treatments. This 
sequential method of testing should be more powerful than a procedure using a 
fixed rank sum and an experimentwise error rate. Computation of exact probabilities 
and appropriate tables is discussed. 


713 PATRICK SUPPES (Stanford University, Stanford, Cal.). Foundations 
of Measurement. 


This paper is concerned with the axiomatic foundations of measurement. The 
procedures and operations of measurement can be represented as a relation algebra 
in the sense of Tarski. The two formal theorems which demonstrate the adequacy 
of a particular scheme of measurement are the representation theorem which estab- 
lishes that the algebra of empirical relations can be mapped homomorphically into 
the appropriate structure of the real numbers, and the uniqueness theorem which 
establishes that the homorphism is unique up to the appropriate group of transfor- 
mations. Some empirical examples of these ideas are given and some problems of 
axiomatizability are mentioned. 


WILLIAM F. TAYLOR (University of California, Berkeley, Cal.) and 
714 JOSEPH BERKSON (Mayo Clinic, Rochester, Minn.). A Problem of Testing 
and Estimation Involving Four Fold Tables. 


In this paper, the old problem of the analysis of four fold tables is re-examined 
with respect to a peculiar kind of sampling. Assuming a population possesses 
properties A and B, consideration is given to several ways in which samples might 
be taken in order to determine whether A and B are associated. 

Case 1. Simple random sampling. MN members of the population are drawn at 
random and the frequencies of occurrence of A, B; A, not B; not A, B; and not A, 
not B are observed. This is the classical, simple sampling procedure. 

Case 2. Sampling with respect to A. By this is meant the random and inde- 
pendent selection of N; members with property A and No = N — N,; members 
without property A. N; and N» are fixed in advance and the number of cases of 
B and not B are observed in each of these two groups. 

Case 3 is the reverse of case 2, namely, sampling with respect to B. 

In case 4, estimates of various parameters are found by one of Neyman’s methods 
for obtaining best asymptotically normal estimates. A more recent method involving 
a logistic transformation is also used. The powers of several related x? tests are 
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compared with the result that tests based on sampling by case 2 or 3 above have 
higher asymptotic power than one based on case 1 sampling. Case 4 lies between 
case 2 and case 3 in this regard. 


ROBERT M. THORNER and QUENTIN R. REMEIN (U. 8. Public 
715 Health Service). Some Aspects of Screening Tests for the Detection of 
Disease Suspects. 


The screening of large population groups for chronic disease in a growing health 
department activity. The purpose of screening is the selection, for diagnostic 
evaluation, of persons in the population with a high probability of being diseased. 
The application of screening tests requires a basic knowledge of the attributes of 
these tests and an understanding of the results that may be anticipated when a 
particular test is applied to a population group at a selected screening level. 

Tests used in screening must be simple and inexpensive; should be sensitive, 
specific, accurate, and precise. The theory of overlapping distributions is used to 
explain sensitivity and specificity. The stability of these measures and their relation- 
ship to prevalence is discussed. The effect of altered screening levels on sensitivity 
and specificity is illustrated. 

_Problems of selecting screening levels are discussed, together with the 
relationship of sensitivity, specificity and prevalence to the frequency of false nega- 
tive and positive test results. Youden’s test evaluation index is discussed in relation 
to comparing test results with chance. 

The static and stochastic models of screening tests are considered, and ‘‘relative’’ 
sensitivity and specificity are reviewed. The calculation of confidence limits for 
sensitivity and specificity percentages is illustrated, and statistical techniques used 
for comparing results of different screening techniques, for correlated and uncorre- 
lated models, are reviewed and illustrated. ‘Precision’ and “accuracy” are defined, 
and the measurement of precision and comparison of precision measurements is 
illustrated. 


R. M. THRALL (University of Michigan, Willow Run, Mich.). A Review 


716 
of Mathematical Aspects of the Theory of Measurement. 


The past fifteen years have witnessed a vast advance in the techniques and 
theories of measurement. Methods and theories appropriate to the physical sciences 
have been extensively generalized so as to become more applicable to the behavioral 
and life sciences. The present paper is limited to a review of certain mathematical 
systems which either have been used in measurement or show promise of such use. 

Measurement in the physical sciences centered on description of some attribute 
of a physical object by a real number. S. 8. Stevens pioneered in generalizing this 
concept of measurement, and his work stimulated still further generalization and 
abstraction. 

A fundamental point of view in modern measurement theory is that the mathe- 
matical system used for measurement should be no stronger than is justified by 
the properties of the objects being measured. For example, use of the real numbers 
to measure hardness is not justified since neither addition nor multiplication has 
meaning relative to hardness. All that is physically meaningful is that relative to 
hardness objects can be placed in order with ties permitted. Thus for the “degrees 
of hardness” only a chain order is mathematically justified. The presence of ties 
in hardness indicates that a weak order is suitable for the objects themselves. 
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The present paper is devoted to a review of the development during the past 
fifteen years of mathematical models appropriate for measurement. These models 
vary in the strengths of the axiom systems used and in the extent to which stochastic 
theories are employed. Most of the early systems are entirely deterministic; later 
systems are increasingly stochastic in nature. 


M. B. WILK and R. GNANADESIKAN (Bell Telephone Laboratories, 
717 Murray Hill, N. J.). Some Remarks on Plotting Procedures in the Analysis 
of Experiments. 


The paper deals with certain problems of estimation, associated with plotting 
procedures in analyzing experiments, using rank order statistics. A detailed discus- 
sion of the half-normal plotting procedure in this connection is included. 


The following are abstracts of papers presented at The Symposium on Quantitative 
Methods in Pharmacology held in Leyden, The Netherlands, May 10-13, 1960. 


718 J. HAJNAL (London School of Economics, London, England). Sequential 
Tests on Analgesics in Rheumatoid Arthritis. 


The paper is concerned with a series of clinical trials carried out in the 
Rheumatism Research Clinic of the Royal Infirmary, Manchester. The background 
and trial procedure are briefly described. The statistical analysis appropriate to 
this procedure, if sequential methods are not used, is compared with the sequential 
method in fact employed. The advantages and disadvantages of incorporating 
this sequential method in the design are discussed and some possible improvements 
in the sequential analysis are indicated. 


719 N. L. JOHNSON (University College, London, England). On the Choice 
of a Sequential Test Procedure. 


1. In the construction of a standard sequential probability ratio test (s.p.r.t.) 
procedure we need to define (i) two pivotal hypotheses Ho , Hi , (ii) two (approxi- 
mate) probabilities of error ao , a1 , such that 


Pr {reject H; | H;} a; G= 1,2). 


It is known that the s.p.r.t. is then optimal in the sense that among all procedures 
satisfying (ii) the s.p.r.t. has (nearly) the smallest expected size of sample whether 
either Ho or H is true. 

2. Conditions (i) and (ii) are the same as those often used in determining the 
size of sample to be taken in a classical fixed sample procedure. In such cases the 
hypotheses Ho and H, (and the probabilities a») and ai) are chosen in the light of 
the required sensitivity of the procedure. For example Ho may be a ‘null hypothesis’ 
(and ao a ‘level of significance’) corresponding to a desirable level of quality, with 
H, representing a change in quality of such an amount that a high probability 
(1—a1) of detecting it is needed. 

If Hy and H, , determined in this way, are used to define a s.p.r.t., the full 
advantage of the sequential method may be lost unless either Ho or H, is true 
(or nearly so) in a large proportion of cases. When a hypothesis H, different from 
both Ho and H,, is true there is no guarantee that the s.p.r.t. procedure defined 
by Ho, Hi, ao, a: will even have a lower average sample size than a fixed sample 
procedure defined in the same way. 
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3. As.p.r.t. satisfying (ii) need not necessarily be based on Hy and H, as pivotal 
hypothesis. It is suggested that consideration should be given to the introduction 
of additional hypotheses Hj , H{ , to be used as pivotal hypotheses. Hé and H{ 
would be chosen to represent the sort of conditions expected to be encountered 
rather frequently—so that the saving in average sample size should be as effective 
as possible. The associated (approximate) probabilities of error af , @; (where 
Pr {reject H}/H/} & a! ;j = 1, 2) are then determined by conditions (ii). 

4. In this paper detailed consideration is given to the case when the s.p.r.t. is 
regarded as a test of the ‘null hypothesis’ Ho , and where we take H ' = Hy). Some 
tables are given to aid in the construction of the test procedures, and also to describe 
the properties of such tests. 

5. Incidentally to the main topics, some results are obtained on the conparison 
of operating characteristics (or power functions) of comparable s.p.r.t.’s. 


CHR. L. RUMKE (Free University of Amsterdam, Neth.). An Efficient 


720 
Design for Comparing the Effects of Two Treatments. 


The variability of the reactions of groups of animals to drugs often interferes 
with an efficient comparison of the effects of two treatments with all or none re- 
sponses. In classical HD; -determinations it is difficult to choose such dose-levels 
that the inefficient 0- or 100% effects are avoided. A design is to be described which 
leads to an efficient test for the difference between two treatments at an optimal 
dose level. A rough estimate of the ZD50’s can be made by slightly modifying this 
design—though with some loss of efficiency. 


K. L. SMITH (Boots Pure Drugs, Nottingham, England). Sequential 


721 
Analysis Applied to Biological Control Tests for Pharmacopoeial Substances. 


During some biological examinations to control the quality of pharmacopoeial 
substances, sufficient evidence may be obtained at one time to satisfy the official 
requirements and the results from such examinations may be analysed by the tradi- 
tional methods. 

In some examinations, such as those to exclude undue amounts of pyrogens, 
the evidence collected at one time may be sufficient to accept or reject the material 
in the extreme cases. For intermediate cases repeat tests are permitted and the 
._ criteria for them may be conveniently derived from a sequential sampling plan. 

In a large number of tests the evidence must be collected sequentially either 
as single responses as in the assay of Digitalis using guinea pigs or pigeons, or as 
the results from repeated tests which individually are too imprecise as in the assay 
of Insulin. Both circumstances lend themselves to the application of sequential 
analysis. In both cases where one has set out to make a certain number of observa- 
tions before applying traditional methods of analysis, the value of using a sequential 
plan may be one of economy only. In those cases where testing is continued until 
the sample may be considered satisfactory, it is essential that the amount of repeat 
testing should be sufficient to satisfy a sequential sampling plan. 


D. J. MEWISSEN and E. H. BETZ (l’Universit. de Liége, Belgium). Se- 
quential Tests in Protective Effects of Cystamine and Chlorpromazine. 


A simple method based on sequential test has been used by Krmpatu and al. 
(Radiation Research, 7, 1, 1957) for screening compounds in radiation protection. 
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Under specific conditions, the method is easy to apply and the decision of accepting 
or rejecting a prospective action of protection is based on the difference in animals 
surviving in the treated and control groups. 

The effectiveness of protective agents widely varies with the delay between 
administration to animals and exposure. The responsible factors and the under- 
lying mechanisms are poorly understood. Therefore, we investigated variations in 
the protective action of cystamine and chlorpromazine versus delay in X-irra- 
diation of treated animals. Usual sampling methods would have been unpractical 
and expensive as existence of a protective effect was questionned under various 
time specifications. Sequential tests were used. A 60% survival in the treated 
group was assumed a meaningful increase over 50% survival in the control group. 
Maximum risks for both kinds of errors were set at 10%. Tests were arbitrarily 
stopped after five steps. Decision was usually reached after three steps. 

In a first series of experiments, groups of ten animals were used. In treatment 
groups, chlorpromazine (5 mg) was injected intraperitoneally to adult pure inbred 
rats (average body weight: 180 g), respectively 2, 3 and 4 hours prior to total body 
irradiation (750 r). In a second series of experiments, groups of 5 animals were 
used. Cystamine (100 mg/100 g of body weight) was injected intraperitoneally 
to rats, respectively 2, 10, 20, 30 and 45 minutes prior to irradiation (900 r). In 
each instance, a sequential test was performed on successive groups. In addition. 
mortality in all groups was recorded for 30 days and mortality curves were drawn. 
In all cases, decisions reached at an earlier stage with sequential tests were confirmed 
by further examination of the completed mortality curves. 


723 E. W. PELIKAN (Boston University, Boston, Mass., U. S. A.). Dose Meta- 
meters in Comparative Pharmacology. 


Both inter-species and intra-species comparisons of sensitivity to drugs of 
common mammals, including man, may be facilitated by expressing dose as a 
power function of body weight. Determination, from published data, of the con- 
stants in the equation [log mean total dose (mg) = 6 log mean body weight 
(gm) + a] yielded the power function best suited to inter-specific comparison of 
potency of drugs of three classes. When paralysis was the criterion of effect, b for 
tubocurarine, its dimethylether, benzoquinonium, gallamine, and succinylcholine 
did not differ significantly from 1.0; for decamethonium, 6 was 0.60, significantly 
less than 1.0. When anesthesia was the effect, b for amobarbital, pentobarbital and 
secobarbital was about 0.80 and b was independent of route of parenteral drug 
administration. When acute death was the effect, b was 0.83 for ouabain, 1.01 for 
digitalis tincture. In contrast, analogous values of b for digitoxin, digitalis, ouabain 
or Lanatoside C (computed using data for total lethal doses and body weights of 
individual members of a given species) varied from 0.52 to 0.89 (median: 0.70) for 
these drugs in each of several species. Minimizing inter-species variance in potencies 
of drugs, precision in estimating and predicting drug potencies among species or 
individuals, and detection of species differences in sensitivity to drugs may be 
furthered by use of an appropriate dose metameter. 


724 P. J. CLARINGBOLD and C. W. EMMENS (University of Sydney, 
Australia). Biological Assays Involving Quantal and Semi-quantal Responses. 


Various transforms have been used for interpreting the results of biological 
assays and other experiments when the response is quantal. Usually, intervals 
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are studied in which transforms such as the probit, logit or angular show no per- 
ceptible differences, and it is a matter of practical convenience which is used. The 
angular transform is often the most useful, as it has equal weights at all response 
levels. 

In much early work, few dose-response lines had to be calculated, and the 
design of tests was simple. More and more frequently however, screening trials or 
complex designs require methods which offer rapid analysis, even if approximate. 
Also, within-animal quantal assay will usually yield few responses from each animal 
and the application of even the angular transform becomes extremely complex. 
In such circumstances, analyses of variance of the untransformed data (scored 
0, 1 or 0, 1, --+ ) is defensible and gives rapidly computed estimates which show 
little loss of information and allowed of up to a 4-fold increase in precision in cross- 
over assays with mice. In factorial tests, probit or even angular transformation 
leads to formidable computations if many factors are involved, whereas analysis 
of variance of the untransformed data is simple and remarkably accurate in practice. 


725 R. VAN STRIK (Philips-Duphar, Weesp, Neth.). A Simple Method of Esti- 
mating Relative Potency and its Precision from Semi-quantitative Responses. 


A simple procedure is described for evaluating those bio-assay experiments 
where the observed responses are scored in a semi-quantitative scale, such as =, 
+, ++, +++. The initial responses are ranked together in order of magnitude 
and a potency ratio with its confidence limits are calculated from the ranks using 
customary bio-assay computational methods with just one minor alteration. The 
method is outlined in an example and its reliability is investigated. 


CLAUS RERUP (The Royal University, Lund, Sweden). On the Validity 


726 
of a Cross-over Assay. 


Cross-over assays are valid only, when the following criteria are fulfilled: 

1) The assay design has to be balanced, which means that a set of observations 
always has to be obtained not only in a certain order, but simultaneously in the 
reverse order because of possible changes in the magnitude of reactions between 
treatments within subjects. 

2) No significantly different changes in Penean between groups of subjects 
_ are allowed to occur from treatment to treatment. 

For the twin and triplet-cross-over assay (insulin, corticotrophin) it is shown 
that the hitherto applied validity tests (in the twin cross-over assay the test for 
parallelism of the dosage response lines for standard and test, in the triplet cross- 
over assay extended by a test for curvature of the mentioned lines) are not appropri- 
ate, because group differences may cause apparent significant differences in slope. 
Group differences are, however, eliminated in the cross-over analysis. An example @ 
of a twin cross-over assay on insulin is given, which must be regarded as invalid 
according to hitherto published validity criteria, which, however, is a good and 
valid assay according to the essential validity criteria to be presented in the paper. 


J. HEMELRIJK (Technological Institute, Delft, Neth.). Experimental 


ae Comparison of Student’s and Wilcoxon’s Two Sample Tests. 


The choice between the many available statistical methods is often difficult to 
make in practical situations. It is often—too often—based on personal preference 
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without sufficient regard for the merits of the different methods available. A lack 
of knowledge about the small sample properties of many distribution-free methods 
tends to make their position uncertain in comparison to classical methods, based 
on specified suppositions about the sampled populations; the properties of the latter 
are known, if these suppositions are true. 

In a small experiment, using QuENOUILLE’s (Biometrika, 46 (1959), 178—204) 
50 independent pairs of samples of size 10 were used to compare STUDENT’S and 
Witcoxon’s two sample tests. 

The samples were first taken from a standard normal distribution (QUE- 
NOUILLE’S 1), then ene of them was shifted over distances of 0.81; 1.05 and 1.53 
to the right. These shifts correspond to a power for SrupEN?’s test of 0.40; 0.60 
and 0.90 respectively. In all these positions both tests were applied onesidedly 
with level of significance 0.025. As was to be expected—Srupmnv’s test being 
uniformly most powerful in this situation—the power of Witcoxon’s test was the 
smaller one. The results indicated that in this case one would not be far off the 
mark in stating that, roughly, the loss of power was about 10% of the power of 
Srupunt’s test as long as the latter is not too close to 1. (This also holds under the 
hypothesis tested: the true level of significance is for SrupENT’s test 0.025 and for 
Witcoxon’s test, owing to its discrete character, 0.022 in this case.) 

A second point of investigation was the reaction of the tests to an observational 
error in one of the original observations. For each pair of samples an observation 
was chosen at random from the sample with the larger median and this observation 
was shifted over a distance 2 to the right. In accordance with expectation STUDENT’S 
test proved to be far more sensitive to this mild kind of slippage than W1Lcoxon’s 
test. 

Finally all observations, transformed to observations from a rather skew ex- 
ponential distribution (QUENOUILLE’S x4), Were again tested in the same way. The 
shifts were in this case replaced by multiplication of the observations of one of the 
samples from the starting point of the distribution, in order to keep this point 
invariant. The factors were: 1 (identical distributions); 2; 3 and 4.5, resulting 
in power function values of roughly 0.025 (the level of significance); 0.25; 0.45 and 
0.70 respectively. The two tests were now approximately equally powerful. 

All these results are dependent in a statistical sense, being based on the same 
1000 observations. Also the investigation was too small (although laborious) to 
warrant strong conclusions. It is only the beginning of more extensive research 
with more powerful means. 


II. DE JONGE (Netherlands Institute of Preventive Medicine, Leyden, Neth.). 
728 The Influence of Non-normality on the Significance Levels of Student’s Two 
Sample Test. 


The paper describes an empirical sampling study of the distribution of Student’s 
t for small samples taken from four populations with non-normal distributions, 
respectively showing: A. Marked skewness, B. Skewness with a bunch of outliers, 
C. Leptokurtosis with skewness, D. Platykurtosis with skewness. These types 
of distributions were chosen, because they were frequently seen in practice. 

From each population pairs of random samples of 5, 10 and 20 elements were 
drawn and the values of t were computed. For each of the populations and for 
each sample size the empirical sampling distribution of ¢ has been compared with 
the corresponding Student-distribution. 
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729 H. R. VAN DER VAART (University of Leyden, Neth.). On the Robustness 
of Wilcoxon’s Two Sample Test. 


It has long been known that the probability of Type I error of Student’s two 
sample test is subject to appreciable variation in the case of unequal population 
variances (Hsu, 1938, Stat. Res. Memoirs; Chand, 1950, Ann. Math. Stat.). There- 
fore it is interesting to know the behaviour of Wilcoxon’s two sample test in this 
respect. Van der Vaart (1950), Indag. Math.) found in a special case for very small 
samples that Wilcoxon’s test is less sensitive to inequality of population variances 
than is Student’s tests, Kruskal and Wallis (1952, Journ. Amer. Stat. Assoc.) put 
forward an argument which lends plausibility to the conjecture that Wilcoxon’s 
test may be fairly insensitive to differences in variability, and Hodges and Lehmann 
(1956, Ann. Math. Stat. p. 327-329) show that Wilcoxon’s test is often less powerful 
against contamination with a shift than is Student’s test. All this might lead one 
to expect that Wilcoxon’s test would in general be less sensitive to the above-men- 
tioned inequality of variances than is Student’s test. Now this turns out not to be 
uniformly true. For instance, in the extreme case where one of the population 
variances is zero, the relation between the sample sizes determines which test. will 
behave worst in this respect. In the general case the situation is rather complicated. 


I. JUVANCZ (Math. Institute, Hungarian Academy of Science, Budapest, 
730 Hungary). Contra Indication of Non-parametric Tests in Medical Experi- 
mentation. 


According to the author’s personal experience from the design and/or analysis 
of over 700 experiments (most of them medical), the statisticians engaged in medical 
experiments show a reluctance to use non-parametric methods, though numerous 
theorists enthusiastically advocate them. A statistical survey is made of papers 
from some medical, biometrical and statistical periodicals, published in various 
countries. The results show the same tendency as the author has noted at home. 

The reluctance of those “qualified in medicine or, if not so qualified, at least 
well soaked in it’’ is caused by some characteristic features of medical experimen- 
tation, well known to them. So e.g. in medical experiments the interest is mainly 
centered on the estimation of mean values and their fiducial limits, and on material 
significance. There is great variation between and within individuals; the measure- 
ments are poor; the number of observations is small. Experiments are sometimes 
dangerous. Numerous factors disregarded in the statistical analysis influence the 
flnal decision. Experimental results are retested several times by others before 
they are accepted. The results are often intended for individual use (e.g. diagnostic 
test). Ete. etc. 

These characteristics compel the researchers to use statistical methods which 
yield results convertible to the original measure (e.g. kg, cm). Order statistics 
and the x2-tests are not of this kind. Further, to minimize the loss of information 
non-parametric methods are too wasteful. Consequently non-parametric tests are 
economically inefficient both qualitatively and quantitatively. 

Not rarely are non-parametric tests insensitive to changes regarded medically 
substantial and too sensitive to insubstantial ones. (E.g. x?-tests are insensitive to 
consistent though little increasements and sensitive to great though inconsistent 
ones; this is not the case with the ¢-test.) If is often impossible to decide what 
caused the statistically significant change indicated by a non-parametric test (mean, 
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type of distribution etc.). On the other hand, tests of the mean are very “robust’’ 
against non-normality. 

Only s small field of application is left for non-parametric methods: the cases 
where the experimental techniques are too weak; e.g. when the results are only 
given in rank-numbers as sometimes in psychometrics. 

Consequently, the use of non-parametric methods should be avoided, if necessary 
by the invention of better experimental (e.g. chemical) methods. 


731 L. J. GOLDBERG (University of California, Berkeley, Cal., U. S) A.)e 2A 
Visual Statistical Approach to Bio-assay. 


In quantal bio-assay one usually starts with the following information: 

‘A series of doses (di, d2, +++ , di) has been administered to (Na, Noga eee 
test hosts respectively, and the corresponding reactors (r1,72,°°* , 7) are observed 
at each test dose level’. 

Based on a set of such experimental observations, a visual procedure will be 
presented for estimating the true dose response. The proposed procedure is based 
on the concept that a single experimental observation should be considered as a 
triplet (r, N, d). The entire set of experimental observations may be considered 
as follows, in triplet notation. 


(r1, Ni, d1), (r2, Nz, de), ie (tegen Gee 


The problem of estimation is reduced to the operation of ‘triplet addition’ 
conditional to an assumed dose response model. The rules for triplet addition 
will be illustrated in detail for the single hit dose response model. The extension 
to other-dose response models will be considered, including the logit and probit 
as specific examples. 


732 CONSTANCE VAN EEDEN (Mathematical Centre, Amsterdam, Neth.). 
On Distributionfree Bio-assay. 


In bio-assay the following situation is considered: a stimulus (e.g. a drug or a 
vitamin) is applied to a subject, resulting in a response produced by the subject. 
In this paper we only consider the quantal (all-or-nothing) response. Then for 
each subject there will be a level of intensity of the stimulus above which the re- 
sponse occurs and below which it does not occur; this level is called the tolerance 
of the subject. On a population of subjects this tolerance will be a random variable 
, with distribution function F(A), say. The problem in bio-assay is to give an 
estimate of this distribution function F(\). A second problem is to compare the 
distribution functions /',(d) and F.(d) for two different stimuli. The observations 
then consist of the results of applying the stimulus once to each of a number of 
subjects, at several doses. 

The abovementioned problems are solved for the parametric case, e.g. when 
» has a normal distribution; the methods given for this situation are called “‘probit 
analysis’? and are described e.g. by D. J. Finney (1947). 

The application of this method, however, requires in some cases laborious 
computational work. Moreover difficulties arise if, for one or more doses the number 
of observations is very small or if none of the subjects or all subjects give a response. 
Finally the assumption of normality may not be fullfilled. 

This paper contains a description of what may be obtained if no assumptions 
are made on the form of the distribution of }. Let observations be available at 
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k different doses d; , +++ , dx with dy < +++ < d,. The number of observations at 
dose d; is denoted by n; and the number of subjects giving a response at dose d; 
is denoted by a; . We suppose all observations to be independent. Let further p; 
denote the probability that a subject gives a response at dose d; then pi = F(d,); 
consequently, F(A) being a monotone non-decreasing function of A, we have 
PESOS pee 

For this situation maximum likelihood estimates of p; = F(d;) may be obtained 
by means of a method described in the thesis of C. van ExpEN (1958). In this 
thesis a more general problem is considered, where k parameters of k distribution- 
functions are to be estimated, if it is known that these parameters are partially or 
completely ordered and moreover confined to given intervals J;. If the distributions 
are binomial, the ordering is complete and J; is the interval [0, 1], we obtain the 
above mentioned problem of estimating probabilities p: , --: , px satisfying 
Pics "7's SS pe. 

These maximum likelihood estimates may be obtained in a simple way and it is 
not necessary to have a large number of observations at each dose. Thus a maxi- 
mum likelihood estimate of F(X), at the doses used in the experiment, is obtained 
without any assumptions of a parametric nature about the distribution function 
F(A). 


References 


Van Expen, C. (1958), Testing and estimating ordered parameters of probability 
distributions, Thesis Amsterdam. 


Finney, D. J. (1947), Probit Analysis, Cambridge University Press, Cambridge. 


C. W. DUNNETT (University of Aberdeen, Scotland). Statistical Theory 
of Drug Screening. 


Screening a chemical compound for some specific form of chemotherapeutic 
activity is considered as a decision problem in which the terminal decisions open 
to the investigator are either to accept the compound as being worthy of further 
investigation or to reject it as being of no interest and the unknown “state of nature” 
is the degree of activity of the compound. A truncated sequential procedure for 
screening is proposed, in which rejection can occur at any stage but acceptance is 
allowed only at the final stage. The particular case in which only two levels of 
activity exist (called ‘active’ and ‘‘inactive’’) and all compounds have the same 
a@ priori probability of being active is considered in detail. A criterion of optimality 
is introduced, which involves the cost of testing and the costs associated with wrong 
decisions. A method for computing the critical rejection levels when the testing 
errors are normally distributed is given, and illustrated for one, two and three- 
stage procedures. Extensions to cases in which some compounds are more likely 
to be active than others, and in which more than two levels of activity may occur, 
are discussed. 


M. A. SCHNEIDERMAN (Cancer Research Institute, Bethesda, Md., 
734 U.S.A.). Statistical Problems in the Search for Anti-cancer Drugs by the 
National Cancer Institute of the United States. 


The development of the external (contract) screening program of the Cancer 
Chemotherapy National Service Center is described. The basic experiment is a 
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joint toxicity-therapeutic trial. This permits the administration of maximum 
tolerated doses without the need for extensive prior toxicity testing. The basic 
screen uses three different mouse tumors with each tumor system examined in- 
dependently. A multi-stage sequential scheme is exployed; a two-stage scheme for 
‘natural’ products and a three stage scheme for synthetic chemicals. The operating 
characteristic curves for the two systems are compared, The acceptance and re- 
jection levels have been so adjusted that the two systems yield the same number 
of false positives, while the two-stage scheme yields more false negatives. This 
can be tolerated because of the nature of the natural products coming to the screen. 
The actual behavior of one of the screens is appraised by examining the responses 
to a “known positive’ compound, tested by different screeners in different places 
at different times. 

Problems of mechanical data processing, the use of additional screens beyond 
the basic three, secondary screening, optimal dose-finding, ‘positive’ and “nega- 
tive” controls, possible deterioration of natural products, and some possible non- 
parametric approaches are considered, as examples of unsolved problems. 


J. J. GQRIMSHAW and P. F. D’ARCY (Allen Hanburys, Ltd., Ware, England). 
735 Some Problerns Arising in Drug Standardization and the Selective Screening 
of Compounds of Potential Pharmacological Interest. 


During the course of routine standardization and screening procedures a variety 
of problems have been encountered. Examples of in vivo and in vitro techniques 
are given, in which the assays of purgatives and succinylcholine together with the 
screening of potential analeptics and barbiturate potentiating drugs, antihistamines 
and neuromuscular blocking agents are considered. Solutions to some of the prob- 
lems are proposed and the need for careful statistical control throughout is em- 
phasised. It is suggested that, in some cases, insistence on statistical rigour may 
lead to results of diminishing practical importance, and the implications of this 
are discussed. 


NORBERT BROCK (Asta Werke, Brackwede, Germany) and BERTHOLD 
736 SCHNEIDER (Karlsruche, Germany). Pharmacological Characterization 
of Drugs by Means of the Therapeutic Index. 


For the pharmacological characterization of drugs it is not sufficient to describe 
their different qualities of action as such; in order to obtain a clear picture of the 
value and dangers of any substance, i.e. of its margin of safety, it is necessary to 
plot one of its actions against the other. As a measure of the margin of safety we 
introduced the therapeutic index (DL 5/DC 95 or DL 5/DE 95) which expresses 
the relation between the curative and toxic doses. Based on more than 10 years’ 
experience, the author discusses some practical examples in various fields of pharma- 
cotherapy. The trouble in applying the therapeutic index is the difficulty in ob- 
taining approximate values for the deviations of the index and in having suitable 


testing methods available. Both problems are discussed and suggestions made 
for practical use. 


737 E. J. ARIENS (University of Nijmegen, Neth.). Analysis of the Action of 
Drugs and Drug Combinations. 


As a rule the determination of biological activities of drugs aims at the com- 
parison of the results obtained with various drugs or with one drug under different 
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circumstances, as for instance different doses. The experiments must therefore 
be based on a sound design and the experimental results evaluated with the aid 
of statistical methods. A primary question remains, however, whether the effects 
or magnitudes that are compared are really comparable in the sense meant by the 
investigator. This is especially of importance if the experimental results serve 
the study of relations between the chemical structure of drugs and their biological 
activity. Based on the result of such studies a more direct design of new drugs, 
with a minimum of trial and error in the research procedure, might be possible. If 
experimental data are compared, an analysis of the biological effects studied is 
necessary in order to find out if, to what degree and in what sense these data are 
comparable. This will be demonstrated by some examples. 


S. DIKSTEIN (Hadassah Medical School, Jerusalem, Israel). Physical 


738 
Chemistry of Drug-receptor Interaction. 


1) The effectivity in vitro of homologous series of aliphatic choline esters, 
straight chain aliphatic primary amines and straight chain aliphatic monoquater- 
nary ammonium compounds was determined on the small intestine of guinea pig. 

2) Morphine, an inhibitor of the acetylcholine release, inhibits the contractions 
caused by small doses of nicotine, but has no effect on the contraction caused by 
acetylcholine. This phenomena is similar to the inhibition caused by botulinum 
toxin, which is known to act at the nerve ending. Since morphine has no effect 
on the contractions caused by any of the homologous amines tested by us, it was 
concluded that the primary amines and monoquaternary ammonium ions act 
directly on the smooth muscle. 

3) In the homologous series, the activity increases with increasing number of 
methylene groups up to a certain limit, the further addition of a single methylene 
group then abolishes the contractive power or makes the compound inhibitory, 
Addition of more methylene groups first increases the inhibitory power and then 
decreases it. 

4) A qualitative explanation is offered to explain the observed facts. According 
to this hypothesis in the presence of a positive charge, if the solubility parameter 
of the compound is higher than that of the biophase we get excitation, if the solubility 
parameter is lower we obtain inhibition. The smaller the difference, the greater the 
effectivity. On the molecular level the excitation is explained by other workers 
' by the interaction of materials with monolayers. According to this work it is 
postulated that if the material has a lower solubility parameter than that of the 
biophase it will dissolve in it increasing the spreading force. A material with higher 
solubility parameter will decrease the spreading force. 

5) Since the excitation is a kinetic phenomenon (the velocity of addition of 
the excitant determines the activity) whereas the solubility parameter represent 
in fact a term of energy, it is not difficult to explain the absence of quantitative 


predictions. ; Assn 
_6) The importance of the solubility parameter in other drugs is discussed. 


739 S, E. DE JONG (University of Leyden, Neth.). Isoboles. 
In this lecture reference is made to Loewe’s method, by which the pharma- 
cological effect of two drugs in combination is diagrammatically represented by 


means of isoboles. 
A definition with a short description is given, followed by a survey of the most 
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important types. Some special cases are discussed more fully. It is explained that 
some forms of synergism do not lend themselves to being represented by means 
of isoboles (allobiotic synergism; inversion). Not only because of this do the isoboles 
not yield an optimal solution to the pertaining problem. It may be considered 
objectionable that it is necessary to estimate the exact amounts of a series of mixtures 
of two drugs needed for a quantitatively specified effect. Besides, the isobole thus 
obtained even then gives only incomplete information; for stronger or weaker effects 
an entirely different type of isobole can be found. 
In conclusion Loewe’s rather complicated nomenclature is enumerated. 


P. 8. HEWLETT and R. L. PLACKETT (Pest Infestation Laboratory, 
Slough, England). Models for Quantal Responses to Mixtures of Two Drugs. 


The various previous approaches to the construction of mathematical models 
for quantal responses to mixtures of drugs are briefly referred to, and it is pointed 
out that models for graded responses to mixtures cannot in general be transferred 
direct to the interpretation of quantal responses. A general method for developing 
models for quantal responses to mixtures is put forward. This is exemplified by 
development of a model for simple similar joint action, i.e. a situation in which 
two drugs have a common site of action, but in which neither modifies the behaviour 
of the other at their site of action or elsewhere. 


741 J. GURLAND (Iowa State University, Ames, Iowa, U.S. A.). Determination 
of Minute Insecticidal Residues through Biological Assay. 


Samples of alfalfa from a field sprayed uniformly with Guthion, were collected 
1 day, 3 days, 7 days, 2 weeks, 4 weeks after spraying. The residue in these samples 
was determined by means of a biological assay in which houseflies were exposed to 
increasing doses of the insecticide. Since sample extracts contained lipids and other 
materials which caused a masking effect, the standard preparations used for com- 
parison were made to contain comparable amounts of control extract. The usual 
type of parallel line assays employing mortality of houseflies were employed for 
determining the residues in the 1-day, 3-day, and 7-day samples, but the extracts 
for the later samples had to be fortified with known amounts of the Standard in 
order to raise the response from a very low level to a level at which a parallel line 
assay could again be employed. 

To increase the sensitivity of the technique it was found that the ratio of the 
volume containing the fortifying Standard to the volume of the Test extract should 
be made as small as possible. Further, the final volume of Test extract should be 
small relative to the sample mass being extracted in order to increase the sensitivity 
expressed in such units as parts per million of sprayed alfalfa. 


742 A. G. MATHEWS (Dept. of Health, Victoria, Australia). The Interpretation 
of Antibiotic Blood Level Curves. 


Although hundreds of investigations of antibiotic blood levels have been reported, 
in comparatively few has any examination of the statistical significance of observed 
differences been performed, and in even fewer has any attempt been made to assess 
the pharmacological significance of the results. Consequently, the literature is 
filled with conflicting reports as to the relative efficacy of various dosage forms, 
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and the lack of adherence to any common experimental design makes the resolution 
of such conflicts very difficult. 

The problem of deriving a single statistic, with which to summarize the infor- 
mation contained in a set of blood level curves, is rendered difficult through ignor- 
ance of the relative importance of the height of the peak, the time at which the 
peak occurs, and the period during which the curve is above an arbitrarily chosen 
level. Attempts which have been made to derive such statistics will be reviewed, 
and their limitations discussed. 

The shape of the blood level curve in the region of the peak is usually difficult 
to determine, because of practical objections to the frequent collection of samples 
of blood. Advantages of experimental designs, in which only a few samples are 
required from each of many subjects, will be discussed. 

No matter how carefully sets of blood level curves may be determined, nor 
how ingeniously their properties may be summarized statistically, the problem 
of pharmacological interpretation remains. Stress will be laid on the importance 
of ancillary studies: for example, of the rate of excretion of the antibiotic, and 
of its distribution in the tissues of the body. 


W. Z. BILLEWICZ (University of Aberdeen, Scotland). A Statistical In- 
743 vestigation of Factors Affecting the Accuracy of D.O Determinations by the 
Falling Drop Method. 


After a short description of the method the results of the analysis of the trial 
series are briefly discussed and taken as the basis of subsequent experiments. The 
experimental results cover various aspects of the method. The effects of changes 
in room temperature, position of the dropping tube and dropping technique are 
considered. Errors of the distillation stage are considered and discounted. The 
problem of the optimal selection of tube and drop size is considered in some detail 
in terms of an index of sensitivity of the method. Errors resulting from observer 
timing as opposed to photo-electric timing are discussed. A comparison of two 
main estimation methods is made and finally the problem of serial estimates on the 
same subject is considered. 


J. DUFRENOY (Conservatoire National des Arts et Métiers, Paris, France). 
Using the Arc Tangent Scale for Quantitative Methods. 


The arc tangent scale, used either for the abscissa or the ordinate scale, or for 
both, is most convenient for plotting ‘responses’ as the dependant variable, in 
the range 0 to 100, against the independant variable, in the range 0 to infinity. 

The arc tangent is most eminently suited for transformation of the time (¢) 
into a metameter (r). 
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The following Officers have been elected for 1961: 


President: J. A. Fraser Roberts 
Secretary: C. D. Kemp 
Treasurer: P. A. Young. 


The Summer Meeting of the Region was held on July 6, 1960, at the National 
Vegetable Research Station, Wellesbourne, Warwick. 

A meeting on October 27, 1960, was concerned with ‘‘Applications of Electronic 
Computers to Biological Problems.’’ The following papers were read and discussed: 


W. T. Williams—Some Applications of Electronic Computers in Plant Ecology 
and Taxonomy. 
J. N. R. Jeffers—Data Processing in Biological Research. 
¥. Yates—Problems Arising in the Use of Electronic Computers in Statistical 
Analysis. 


Following the Annual General Meeting on December 13, 1960, the following 
papers were read and discussed: 


C. W. Emmens—The Planning and Analysis of Some Field Trials with Cattle. 
C. C. Spicer—Problems in the Analysis of a Large Scale Clinicial Trial. 


On January 2, 1961, a special meeting was held on “Biometrical Aspects of 
Plant Growth” in collaboration with the Society for Experimental Biology. The 
programme included: 


J. A. Nelder—Models and Experiments for Growth Analysis 


8. C. Pearce and C. 8. Moore—A Study of the Sources of Variation in Growth 
of Fruit Trees. 


M. J. R. Healy—Experiments for Comparing Growth Curves. 
A general discussion was led by F. L. Milthorpe. 


E.N.A.R. 


The following officers have been elected for 1961 by the Eastern North American 
Region: 


President, 1961: Oscar Kempthorne 

President-elect, 1961: Henry L. Lucas 

Secretary, 1961: Erwin L. LeClerg 

Treasurer, 1961: Donald A. Gardiner 

Regional Committee, 1961-63: Virgil L. Anderson and Robert J. Monroe. 
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ENAR TREASURER’S REPORT FOR 1960 


Marvin A. Kastenbaum, Secretary-Treasurer 


INCOME 


Balance Forward 
Checkbook balance Dee. 31, 1959 
Dues payments 


Share of Proceeds from Washington meeting (1959) 


Credit on Canadian checks 
Payment from WNAR for Programs 
Sustaining member credit 

Payment from AIBS for printing 


EXPENSES 


Checks not honored 
Refund on dues 
Treasurer International 
Transfer to Savings Account 
ENAR expenses 

Postage 

Printing 

Clerical 


Checkbook Balance December 31, 1960 


OPERATING ANALYSIS 


Operating income 
Dues payments 
Other societies 
Bank credit on checks 
Sustaining member credit 


Operating expense 
Surplus for 1960 


CHECKBOOK BALANCE 


Advanced from 1959 
Surplus for 1960 


Checkbook balance November 15, 1960 — 


SAVINGS ACCOUNT BALANCE 


Advanced from 1959 
Interest 1959 


Proceeds from Washington meeting (1959) 


Total cash in Bank December 31, 1960 


$187 .93 
365.78 
30.96 


$702 .00 
19.00 
40 
100.00 


324.29 
236.73 


716.11 
6.26 
267.88 


ile Mh 


$ 561 


990. 


.29 
0) 


07 


.02 
.07 


.02 


25 


27 


$1,551. 
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W.N.A.R. 
MINUTES OF THE WNAR BUSINESS MEETING 


The annual meeting of the Western North American Region of the Biometric 
Society was held at 1:00 P.M., August 24, 1960, on the Stanford University Campus. 
William Taylor, President, WNAR, presided. The highlights of the minutes of the 
previous meeting were read by the Secretary, Walter Becker, and were approved. 
The Secretary then gave his report. A method for obtaining News and Announce- 
ment for Biometrics was described. This consisted of sending to everyone who 
changed his address an inquiry as to his position, ete. This system, suggested by 
the ENAR Secretary has produced excellent results. Sometimes ballots are sent to 
student members inadvertently and a reminder was made that they are not entitled 
to vote. Four general mailings to the members of WNAR have been made this year 
The postage cost is about $6 a mailing, but this figure may go up as a result of 
the Secretary’s office being moved to Washington State University, Pullman, 
Washington. 

The Treasurer, Bernice Brown, gave her report. WNAR now has 164 members, 
8 of them students, an overall gain of 14 from last year. As of August 20, 1960, 
there was $445.25 on hand. The operating expenses balanced out the income for 
this year. It was brought out that much of the money in the treasury was accumu- 
lated during past administrations when much of the cost of printing and mailing 
was not born by the Society. WNAR receives $1 for every member, $10 for each 
sustaining member, and nothing from student members. The Treasurer stated 
that perhaps a working capital of $200 would be sufficient in the treasury, declar- 
ing $200 to be surplus. 

The Nominating Committee made its report. Two positions on the Regional 
Committee are now open because of the expiration of the terms of Douglas Chapman 
and Donald Wohlschlag. The names submitted by the Nominating Committee 
were Donald Owen, New Mexico; Lincoln Moses, California; Marion Sandomire, 
California; and Douglas Chapman, Washington. No nominations were made from 
the floor. The vote will be taken by a mail ballot. 

No action was reported by the Relationship of ASA and the Biometric Society 
Committee during the past year by Marion Sandomire. The By-laws of the region 
are being revised by William Taylor and will be submitted soon to the Regional 
Committee. The possibility of having a president-elect was discussed. At present 
the President is elected for two years, in order that he may be more experienced in 
the position. Several alternatives were proposed: 


1. The presidency be made a one-year position. Then we would have a presi- 
dent-elect every year. The president-elect would act as a program chair- 
man and would in addition receive copies of important letters from the 
officers and the Regional Committee. 


2. On alternate years, a president-elect would be elected. 
3. Leave as is. 


It was pointed out that the need for a president-elect might become greater 
as the membership increased. It was suggested that this matter be considered by 
the Regional Committee and be submitted to the membership in a mail ballot. 

Douglas Chapman moved and Calvin Zippin seconded the following motion: 
The By-laws may be amended by a two-thirds vote of members present at any 
annual meeting. Changes in the By-laws may be initiated by the Regional Com- 
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mittee and the By-laws amended by a majority of members voting in a mail ballot. 
This motion was passed by a unanimous vote. 


Suggestions for the use of the surplus fund (about $200) in the treasury were: 


1, Give prizes for the best student paper presented at an annual meeting (it 

was mentioned that this was tried a couple of years ago, but that students 

did not present any papers). 

Do not use the money. 

. Send bound volumes of Biometrics to foreign universities. 

. Pay students’ dues. 

. Pay the expenses of a speaker at the annual meeting. 

. The funds should be used to encourage interest in the society. 

. They could be used to promote membership (it was pointed out that new 
members’ dues would increase the treasury’s surplus). 

8. Prizes could be given to the member bringing in the most new members: 

9. A year’s subscription could be given to outstanding students. 


MIO oh LY 


No agreement could be reached, so the Regional Committee will debate the 
issue by mail. 

Two possible meeting places for the 1961 Annual Meeting were discussed. 
These were the meeting of the Pacific Division A.A.A.S. at the University of Cali- 
fornia, Davis, June 12-17, 1961 and the IMS Meeting at the University of Washing- 
ton, Seattle, in August (later information indicates that this may be in June). A 
majority voted for the Seattle meeting. 

The purpose of the program at the Annual Meeting was discussed. Should the 
program aim at a specific purpose, or should the papers be an expository review 
aimed at applied statistics? There was very little comment. 

William Taylor announced that he would be attending a meeting in the East 
with representatives of other societies to consider the possibility of a federation of 
statistical societies, besides other purposes. 

The meeting adjourned at 2:10 P.M. 

Those present at the meeting were Abramson, Becker, Bennett, Brown, Chap- 
man, Hopkins, Massey, Mode, Nash, Nicholson, Russell, Sandomire, Taylor, 
Vaughan, and Zippen. 


NOTE ON ELECTION 


Douglas G. Chapman and Marion M. Sandomire were elected to the Regional 
Committee for 1961-1963. 


CHANGES IN MEMBERSHIP 
(October 1, 1960-January 15, 1961) 


Changes of Address 


Miss Margaret F. Allen, Department of Biometrics, School of Aviation Medicine, 

_ USAF, Brooks Air Force Base, Texas, U.S. A. 

Dr. David W. Alling, Institute Allergy and Infectious Disease, National Institutes 
of Health, Bethesda, Maryland, U.S. A. 

Professor Maurice 8. Bartlett, Department of Statistics, University College, London 
W. C. 1, England. ‘ 

Prof. Dr, Eduard Batschelet, 2500 Q Street N.W., Washington, D. C., U.S. A. 

Mr. Rainald K. Bauer, Kolner Str. 228a, Dusseldorf, Germany. 
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Dr. Gilbert W. Beebe, National Academy of Sciences, 2101 Constitution Avenue, 
N.W., Washington 25, D. C., U.S. A. 

Dr. G. E. P. Box, Mathematics Research Center, U. S. Army, University of Wis- 
consin, Madison 6, Wisconsin, U.S. A. 

Dr. David Bruce, Box 4059, Portland, Oregon, U. S. A. 

Dr. Joseph G. Bryan, 11 Newport Avenue, W. Hartford, Connecticut, U.S. A. 

Mr. Walter E. Cole, Division of Forest Insect Research, U. S. Forest Service, Ogden, 
Utah, U.S. A. 

Mr. Jerry Cornfield, National Institutes of Health, Bethesda, Maryland, U.S. A. 

Dr. Paul M. Denson, Deputy Commissioner of Health, 125 Worth Street, New 
WAHT NBY ING Mey Wee AG 

Prof. Benjamin Epstein, 768 Garland Drive, Palo Alto, California, U.S. A. 

Dr. Robert Flamant, 11, rue Bouille, Fontenay—Aux—Roses (Seine) France. 

Dr. William A. Glenn, 906 King Street, Cary, North Carolina, U.S. A. 

Prof. John Gurland, U. S. Army Research Center, University of Wisconsin, Madison, 
Wisconsin, U.S. A. 

Dr. Thomas J. Haley, 3401 Woodcliff Road, Sherman Oaks, California, U. S. A. 

Dr. Sudako Hayase, 1038 Fifth Street, Santa Monica, California, U. 8. A. 

Prof. Dr. Ottokar Heinisch, Grunderstr. 28, Berlin—Grunau, Germany. 

Dr. Henry Hopp, Agricultural Attache—Bogota, c/o U. S. Department of State, 
Washington 25, D. C., U. 8. A. 

Professor Gwilym Jenkins, Department of Mathematics, Imperial College, London, 
S.W. 7, England. 

Dr. S. Karatas, Department of Animal Husbandry, University of Ataturk, Erzurum, 
Turkey. 

Dr. Eileen B. Karsh, Department of Psychology, University of Pennsylvania, 
Philadelphia, Pennsylvania. 

Mr. Charles Kiss, La Menitre, (Maine-et-Loire) France. 

Mr. Gary F. Krause, 1110 Kentwood Drive, Blacksburg, Virginia, U. 8. A. 

Mr. William T. Lewish, 2109 Wildwood Drive, Woodland Park, Wilmington 5, 
Delaware, U.S. A. 

Mr. Donald G. MacEachern 608 Third Avenue, S. E., Minneapolis 14, Minnesota, 
U.S. A. 

Mr. John W. Mayne, Van Zaeckstraat 49, The Hague, Netherlands. 

Mr. Jean Mothes, 13, boulevard des Invalides, Paris 7e, France. 

Mr. Clifford A. Myers, 2069 N. Navajo, Flagstaff, Arizona, U.S. A. 

Dr. Wesley L. Nicholson, 1215 E. 3rd, Moscow, Idaho, U.S. A. 

Dr. Fernando Orozco-Pinan, Explotaction “El Encin’”’ Alcala de Henares (Madrid) 
Spain. ‘ 

Dr. Laurence M. Potter, Department of Poultry Husbandry, Virginia Polytechnic 
Institute, Blacksburg, Virginia, U.S. A. 

Dr. D. R. Read, Cadbury Brothers Limited, Bournville, Birmingham, England. 

Mr. A. H. L. Rotti, Av. Emile Verhaeren 70, Gentbrugge, Belgium. 

Mile. Claude Rouquette, 3, rue Saint-Charles, Paris 15, France. 

M. Jean Soule, 30 avenue Saint-Laurent c. 34 (Orsay) Seine-et-Oise, France. 

Mr. John J. Sowinski, Allstate Insurance Company, 7447 Skokie, Blvd., Skokie, 
Illinois, U. S. A. 

Prof. David F. Votaw, Jr., 6 Fairlane Terrace, Winchester, Massachusetts, U. S. A. 

Mr. Jerry Warren, Department of Horticulture, University of Nebraska, Lincoln 3, 
Nebraska, U. S. A. 
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Mr. Francis R. Watson, 1415 Winding Lane, Champaign, Illinois, U. 8. A. 

Mrs. Sandra S. White, 535 E. 72nd Street, New York 21, N. Y., U.S. A. 

Mr. Amador D. Yniguez, Institute of Statistics, P. O. Box 5457, Raleigh, North 
Carolina, U. S. A. 

Dr. Sydney 8. Y. Young, 68 Barker Road, Strathfield, N.S.W., Australia. 


New Members 
At Large 


Mr. Pedro Eliz M. Revello, Instituto Venezolano Petroquim, Centro Simon Bolivar, 
Caracas, Venezuela. 


Brazil 


Mr. Joassy de Paula Neves Jorge, Instituto Agronomico Caixa Postal 28, Campinas 
S.P., Brazil. 


France 
M. Pierre Lossois, 120 bis avenue de Verdun, Issy-les-Moulineaux (Seine) France. 


Germany 


Dipl. G. Enderlein, Karl-Marx-Str. 31, Klein Wenzleben bei, Magdeburg/DDR, 
Germany. 

Mr. Werner Haufe, Sophienstr. 21, Einbeck/Hann., Germany. 

Prof. Dr. A. Maede, Eichenweg 5, Halle/Saale, Germany. 

Mr. Dieter Rasch, Fritz Reiter Str. 22, Rostock, Germany. 

Dr. Ernst Weber, Windhalmweg 36, Stuttgart-Plieninger, Germany. 

Doz. Dr. Hermann Witting, Dreikonigstrasse 9, Freiburg i. Br., Germany. 


Italy 


Dott. Giuseppe Agnese, Istituto di Igiene dell’ Universita, via Pastore 1, Genova, 
Italy. 

Dr. Giancarlo Chisci, Stazione Sperimentale di Praticoltura, Viale Piacenza 25, Lodi, 
Milano, Italy. 

Dr. Maurizio Turri, Viale O. P. Vigliano 17, Milano, Italy. 


Western North American Region 


Mr. Peter A. Dawson, Department of Genetics, University of California, Berkeley 4, 
California, U. S. A. 

Dr. Olive Jean Dunn, School of Public Health, University of California, Los Angeles 
24, California, U. S. A. 

Prof. Robert F. Fagot, Department of Psychology, University of Oregon, Eugene, 
Oregon, U.S. A. 

Dr. Robert Macey, Department of Physiology, University of California, Berkeley 4, 
California, U. S. A. 

Mr. Prem Singh Puri, 1845 Hearst Avenue, Berkeley 3, California, U. S. A. 

Mr. Kurt Sittman, Animal Husbandry Department, University of California, Davis, 
California, U. S. A. 

Mr. Albert R. Stage, 157 South Howard Street, Spokane 4, Washington, U.S. A. 


NEWS AND ANNOUNCEMENTS 


Members are invited to transmit to their National or Regional Secretary (if 
members at large, to the General Secretary) news of appointments, distinctions, or 
retirements, and announcements of professional interest. 


EDITORIAL NOTE 


The general objective of Biometrics is to promote and to extend the use of 
mathematical and statistical methods in pure and applied biological sciences. This 
objective has been broadly interpreted in editorial decisions and this is proper. 
However, it is believed also that mathematicians and statisticians contributing to 
Biometrics have an obligation to present their work so that the natures of the prob- 
lems in the biological sciences to which the methods apply are clearly understood 
by those who may profit by the use of the methods. Similarly, biologists should 
explain their problems so that their needs and methods are clearly understood by 
the mathematician and statistician. While these requirements may not be easy to 
meet, direct consideration of them should greatly increase the value of Biometrics 
to all members of the Biometric Society. Careful attention to introductions of 
papers, including formulation of problems and models, with skillful selection of 
examples would be helpful. When possible, algebraic and mathematical discussions 
should be placed in appendices to papers in order that they do not detract from the 
emphasis on the problem areas and the readability of papers. 


MEETINGS OF E. N. A. R. 


The Eastern North American Region will meet jointly with the Institute of 
Mathematical Statistics and the American Statistical Association on April 20, 21, 
and 22, 1961, at Cornell University. Titles and abstracts, the latter in duplicate in 
the form published in Biometrics, of contributed papers for ENAR should be sent 
to Dr. Erwin L. LeClerg, Biometrical Services, Plant Industry Station, Belts- 
ville, Maryland. 

In 1961, ENAR will also meet jointly with the American Institute for Biological 
Sciences at Purdue University and with the American Statistical Association in 


New York City. 
MEETING OF W.N.A.R. 
The Western North American Region will meet jointly with the Institute of 
Mathematical Statistics in June, 1961, at the University of Washington. 
UNIVERSITY ANNOUNCEMENTS 
THE CATHOLIC UNIVERSITY OF AMERICA 


The Statistical Laboratory of the Catholic University of America has been 
awarded a grant by the National Institutes of Health for training in the field of 
Biometry. 

The stipends for first-year graduate students are $2250 plus tuition; family 
allowances for dependents and annual increases are provided. 
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The students will pursue the same general program as other students in mathe- 
matical Statistics. They will participate in the consulting activities of the laboratory 
and will be required to attend some courses in the biological sciences or other fields 
relevant to the study of biometry. 

In addition to the grants in biometry, there are also fellowships under the 
National Defense Education Act available. Some appointments to graduate as- 
sistantships and research assistantships will also be made. 

The Statistical Laboratory of the Catholic University of America is also expand- 
ing its activities into the areas of biomathematics and biometry. A training program 
and consulting service are being organized. Professor Edward Batschelet (on leave 
from the University of Basel, Switzerland) was appointed Visiting Professor in 
this program for the academic year 1960-61. Professor Harold Bergstrom of the 
Institute of Applied Mathematics of Chalmers Institutes of Technology (Goteborg, 
Sweden) was appointed Visiting Professor for the academic year 1960-61. He will 
primarily be engaged in research in probability theory. Professor D. Dugue of the 
Sorbonne (Paris, France) and Dozent T. E. Dalenius of Stockholm University are 
expected to visit Catholic University during the spring term 1961. 

Requests for further information and application forms should be addressed 
to Professor Eugene Lukacs, Director, Statistical Laboratory, The Catholic Uni- 
versity of America, Washington 17, D. C. 


IOWA STATE UNIVERSITY 


The National Science Foundation will sponsor a Summer Institute for College 
Teachers of Statistics at Iowa State University for the 11-week period from June 5 
through August 18, 1961. The Departments of Statistics of three other universities, 
Kansas State, Utah State and the University of Wyoming, are cooperating with 
Iowa State’s statistical center in presenting this institute. 

Financial support in the form of stipends, dependency allowances and travel 
allowances will be awarded to 50 eligible applicants. All American college and 
university teachers who are, or who during the 1961-62 academic year will be, re- 
quired to teach one or more courses in statistics as part of their regular assignments 
are eligible for consideration. 

The institute is planned to provide additional basic training in statistics for 
present and prospective teachers who, though well-grounded in other fields, have 
limited backgrounds in statistics. Also it will provide more advanced courses and 
seminars designed to keep college and university teachers abreast of new develop- 
ments. 

Courses are scheduled in Statistical Methods, Theory of Statistics, Experi- 
mental Design, Survey Designs, Topics in Foundations of Probability and Statistics, 
and Intermediate Applied Decision Theory. In addition, an opportunity will be 
provided for those interested to observe a demonstration class in Principles of Sta- 
tistics at the undergraduate level. The faculty will include the institute director, 

Dr. T. A. Bancroft, Director of the Iowa State University Statistical Laboratory 
“and Head, Department of Statistics; Dr. R. J. Buehler, Associate Professor of 
Statistics, Iowa State University; Dr. H. T. David, Associate Professor of Statistics, 
Iowa State University; Dr. H. C. Fryer, Head of the Department of Statistics and 
Statistical Laboratory Director, Kansas State University; Dr. H. O. Hartley, Pro- 
fessor of Statistics, Iowa State University; the Institute Associate Director, Dr. 
D. V. Huntsberger, Associate Professor of Statistics, Iowa State University; and 
Dr. R. L. Hurst, Head of the Department of Applied Statistics and Statistical 
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Laboratory Director, Utah State University. Guest lecturers will present a series 
of special seminars. 

Requests for information or application forms should be addressed to: The 
Director, Summer Institute in Statistics, 102 Service Building, lowa State Uni- 
versity, Ames, lowa. 

The Department of Statistics at Iowa State University will offer also eight 
applied courses in statistical theory and methods in its two 1961 summer sessions. 
These courses are planned primarily for graduate students or research workers with 
limited mathematical backgrounds who wish to use statistical techniques intelli- 
gently for application to other fields. In addition, a course on special topics in 
theoretical or applied statistics may be studied at the graduate level. Senior staff 
members will be available during most of the summer for consultations on research 
or special problems. 

Students may register for either or both of the six-week summer sessions: June 
5-July 12 and July 12-August 18. The complete list of statistics offerings for the 
first session is as follows: Stat. 401, ‘Statistical Methods for Research Workers 
“(at the level of Snedecor’s Statistical Methods); Stat. 447, “Statistical Theory for 
Research Workers” (mainly theory of experimental statistics at the level of Ander- 
son and Bancroft’s ‘Statistical Theory in Research’’; Stat. 411, “Experimetnal De- 
signs for Research Workers,” Stat. 599, “Special Topics;”’, Stat. 599A1, “Topics in 
Foundations of Probability and Statistics; and Stat. 699, “Research.” In the 
second session will be offered Stat. 402, a continuation of 401, Stat. 448, a continu- 
ation of 447; Stat. 421, “Survey Designs for Research Workers;” Stat. 599, Stat. 
599A2, “Intermediate Applied Decision Theory (at the level of Blackwell and 
Girshick, Theory of Games and Statistical Decisions), and Stat. 699. 


JOHNS HOPKINS UNIVERSITY 


Beginning next September, the Department of Biostatistics at The Johns 
Hopkins University will offer an expanded program of study and research leading 
to Master of Science and Doctor of Science degrees. The curriculum has been modi- 
fied by increasing the scope of the basic courses in statistical theory and statistical 
methods and by the addition of specialized courses including least squares and 
regression, stochastic processes, nonparametric methods, sampling and survey 
methods, biological assay, design of experiments and digital computer programming. 
These changes reflect a realization of the need for more intensively trained statis- 
ticians in the areas of biology, medicine and public health, and they are the direct 
result of the increasingly important role played by mathematics and statistics in 
all areas of scientific research. 

The new program was made possible in part by the recent addition to the de- 
partment of Drs. Allyn W. Kimball and David B. Duncan who together with Drs. 
Helen Abbey, Earl Diamond, John J. Gart and Margaret Martin form the perma- 
nent staff. In September, 1961, Dr. Norman T. J. Bailey of Oxford University will 
join the staff as visiting professor and will teach the course in stochastic processes. 
Additional appointments may be announced shortly. 

The department has a limited number of liberal fellowships available, and 
interested students are invited to write to the Chairman, Department of Biosta- 
tistics, 615 North Wolfe Street, Baltimore 5, Md. for further information. 


PURDUE UNIVERSITY 


There will be three intensive courses in the general areas of statistical methods 
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and quality control, design of experiments, and operations research at Purdue Uni- 
versity this summer. 

; The course in Statistical Methods and Advanced Quality Control has been 
given at Purdue annually since 1947, being most recently revised in 1960. This 
course is designed for those who have had the equivalent of one of the intensive 
courses in statistical quality control given during and after the war, and who want 
to learn more about the statistical approach to industrial and research problems. 

Topics to be studied during the ten-day quality control course are Significance 
Tests and Confidence Intervals, Significance of Differences, Linear Correlation and 
Regression, Single Sampling for Measurements, Sequential Sampling for Measure- 
ments, Multiple Correlation, and Analysis of Variance. Instructors include Profs. 
Irving W. Burr, the course director, and Charles R. Hicks, both of Purdue, Cecil 
C. Craig of the University of Michigan and Gayle McElrath of the University of 
Minnesota. The dates for this course are September 5-15. 

The second course is on Design of Experiments and is for statisticians, quality 
control personnel, engineers, and others concerned with planning, analyzing, and 
interpreting the results of industrial experiments. The dates for this course are 
June 7-17. This will be the third year this course has been offered. 

Professor Hicks, of the mathematical and statistical staff, the course director, 
emphasizes that this advanced course is for persons who have had previous statisti- 
cal training, including work on tests of hypotheses, linear correlation, and at least an 
introduction to analysis of variance. 

Topics included in the short course are Review of Analysis of Variance, Princi- 
ples of Experimental Design, Variance Component Analysis, Randomized Blocks 
and Latin Squares, Factorial Experiments, Split Plot Designs, Confounding in 
Factorial Experiments, Incomplete Block Designs, Fractional Replications, and 
Introduction to Evolutionary Operations (EVOP). 

In addition to Professor Hicks, instructors will include Prof, Clyde Y. Kramer, 
Virginia Polytechnic Institute, and Professor McElrath. Enrollment will be limited 
to a maximum of 25-30. 

The third ten-day short course is on The Mathematical Techniques of Oper 
ations Research which will be offered at Purdue University for the second time this 
summer on June 5-15. 

The course has been designed for statisticians, quality control analysis, engi- 
neers, and other technical personnel in industrial and management positions. Empha- 
sis will be placed on the mathematical techniques of operations research and the 
application of these methods to current industrial and military problems. 

These methods involve the construction of mathematical models representing 
the operation of industrial management or a military organization, and suggest 
the best solutions to problems involved. Among the topics to be discussed during 
the course are inventory control models, waiting line models, linear programming, 
simplex method, transportation methods, production scheduling models, search 
theory, cost-effectiveness studies, and systems analysis. 

Instructors for the short course are Prof. Paul Randolph of Purdue, who is the 
short course director, Albert Madansky, of the RAND Corporation, and Prof. 
Bernard Lindgren, of the University of Minnesota. Enrollment in this course will 
be limited. 

All three courses are sponsored jointly by the Statistical Laboratory and the 
Division of Adult Education at Purdue. Further information about any of the 
courses may be obtained by writing to the Division of Adult Education, Purdue 
University, Lafayette, Indiana. 
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SOUTHERN METHODIST UNIVERSITY 


The National Science Foundation has awarded Southern Methodist University 
a grant of $26,400 to conduct a basic study of the feasibility of making the new 
Science Information Center at SMU a regional scientific and technical information 
center serving the needs of Southwestern industry and higher education. 

Dr. William J. Graff, Jr., chairman of the SMU Department of Mechanical 
Engineering, is principal investigator for the study. He is assisted by Sam G. 
Whitten, SMU Science Librarian. 

The building to house the new Science Information Center, a gift of local 
industrialists, is under construction and is scheduled for completion next August. 
A million dollar structure consisting of 80,000 sq. ft. on four floors, it will have 
space for half a million books. It will serve the SMU faculty and student body as 
well as the new Graduate Research Center. Space is provided in the building for 
administrative offices of the Graduate Research Center, whose director is Dr. Lloyd 
Berkner; for the SMU Map Library and Herbarium; and for the DeGolyer Geology 
Collection, in addition to the general document collections in science, mathematics, 
and engineering. 

The study sponsored by the National Science Foundation is scheduled for com- 
pletion in May, 1961. It will be based to a large extent on personal interviews and 
questionnaires aimed at the scientists and technicians who are engaged in research 
in the Southwest. An attempt will be made to find out what kind of information the 
research people need and in what subjects. 

Also involved in the study are visits to and detailed studies of existing scientific 
information centers and an economic study of the area done by business economists. 

The investigators expect to reach many research workers by questionnaires 
sent through local professional and scientific and technical societies, and will ask 
for employers aid in setting up personal interviews with research workers. 


UNIVERSITY OF MINNESOTA 


A special summer program of statistics in the Health Sciences will be held 
from June 13 to July 28, 1961 at the University of Minnesota. Courses in diverse 
areas of statistics at elementary and advanced levels will be given. Experts from 
around the nation will be teaching. Stipends are available to qualified students. 
For further information write to Biostatistics, 1226 Mayo, University of Minnesota, 
Minneapolis 14, Minnesota. 


VIRGINIA POLYTECHNIC INSTITUTE 


The 1961 session of the Southern Regional Graduate Summer Session in Sta- 
tistics will be held at the Virginia Polytechnic Institute, Blacksburg, Virginia, from 
June 15 to July 22, 1961. 

The Virginia Polytechnic Institute, Oklahoma State University, North Caro- 
lina State College, and the University of Florida have agreed to operate a continuing 
program of graduate summer sessions in statistics to be held at each institution in 
rotation. The program was instituted at Virginia Polytechnic Institute in the Sum- 
mer of 1954. 

The 1961 session, like previous sessions under this program, is intended to 
serve: 1) teachers of statistics and mathematics; 2) professional workers who want 
formal training in modern statistics; 3) research and engineering personnel who 
want intensive instruction in basic statistical concepts and modern statistical meth- 
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odology; 4) Public Health statisticians who wish to keep informed about advanced 
specialized theory and methods; 5) prospective candidates for graduate degrees in 
statistics; and 6) graduate students in other fields who desire supporting work in 
statistics. 

The session will last six weeks and courses will carry five quarter hours of credit. 
Not more than two courses may be taken for credit at any one session. The summer 
work in statistics may be applied as residence credit at any of the cooperating insti- 
tutions, as well as certain other universities, in partial fulfillment of the require- 
ments for a graduate degree. The program may be entered at any session, and 
consecutive courses will follow in successive summers so that it would be possible 
for a student to complete the course work in statistics for a Master’s degree in three 
summers. Students must satisfy the remaining requirements for course work 
and thesis at the institution where they are to be admitted to candidacy. The 
advanced courses may be accepted as part of the Ph.D. program of the participating 
institutions. 

A limited number of fellowships will be available for applicants from certain 
specialized areas. Doctoral courtesy will be honored for those holding Ph.D. or 
M.D. degrees. 

The courses to be offered in statistics in 1961 at the Virginia Polytechnic Insti- 
tute are as follows: Statistical Methods, Sampling Theory; Applied Statistics for 
Engineers and Physical Sciences; Theory I, Probability; Theory II, Statistical 
Inference; Theory III, Theory of Least Squares; Principles and Practices of High 
Speed Computing (enrollment limited to 14); Non-parametric Methods; and Multi- 
variate Methods. 

A number of courses in advanced mathematics will be available during the 
Summer Session. For a complete listing please consult the University timetable. 
A series of Colloquia involving recent developments in statistical theory and meth- 
ods will be conducted during the special Summer Session. 

Requests for application blanks for the summer session and for fellowships 
should be addressed to Dr. Boyd Harshbarger, Head, Department of Statistics, 
Virginia Polytechnic Institute, Blacksburg, Virginia. 


CONTINENTAL CLASSROOM 


Probability and Statistics, a nationally televised college-credit mathematics 
course, will be offered on the second semester of Continental Classroom beginning 
January 30, 1961. 

Presented by Learning Resources Institute in cooperation with the Conference 
Board of the Mathematical Sciences, and telecast in color and black-and-white 
by the National Broadcasting Company, the course will be taught by Professor 
Frederick Mosteller, Chairman, Department of Statistics, Harvard University, and 
Professor Paul C. Clifford, Professor of Mathematics, Montclair (N. J.) State College. 

More than 300 colleges and universities are expected to offer the televised 
course for college credit. It will be carried by 170 stations throughout the nation, 
from 6:30 to 7 a.m. Monday through Friday in each time zone. 


NEW JOURNALS 


Journal of the Forensic Science Society 


Stuart S. Kind is the editor of a new journal, The Journal of the Forensic Science 
Society with editorial office c/o Rossett Holt, Pennal Ash Road, Harrogate, York- 
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shire, England. Papers on biometry, especially in relation to human individuality, 
classification, and identification systems, will be welcomed. 


Crop Science 


Crop Science is the name of the new research journal which will appear in 
February 1961 as the official publication of the Crop Science Society of America, 
an affiliate of the American Society of Agronomy. 

This new, bimonthly journal will carry research reports on breeding, genetics, 
physiology, ecology, and management of field crops, pastures, ranges, and turf- 
grasses from crop scientists in the U. S., Canada, and other countries. 

Crop Science will be a companion publication to Agronomy J ournal, the official 
organ of the ASA. It will alternate in publication dates with the Journal and will 
carry the articles (formerly publishable in the J: ournal) of direct interest to workers 
in the above-mentioned areas of research. Agronomy Journal will continue to carry 
the articles of wider agronomic scope—of interest to both crop and soil scientists, 
seed and weed technologists, plant pathologists, agricultural meteorologists, and 
others. 

Publication of research reports in Crop Science will be open to members of the 
Crop Science Society—with joint membership in the American Society of Agronomy. 
Detailed information on publication in or subscription to Crop Science or Agronomy 
Journal or both may be obtained from the American Society of Agronomy, 2702 
Monroe Street, Madison 5, Wisconsin. 


TRAVEL GRANTS FOR ATTENDANCE AT THE INTERNATIONAL 
CONGRESS OF MATHEMATICIANS 


Travel grants will be made to a number of mathematicians who wish to attend 
the International Congress of Mathematicians in Stockholm, on August 15-22, 
1962. It is hoped that funds available through various sources may provide travel 
assistance for a considerable number of mathematicians. 

There will be a greater effort than in the past to give aid to younger people. 
As grants will be made only to those who have filed applications, it is urgent that 
any who wish to receive a grant should fill out and file an application. Younger 
people are urged to file applications so that their cases can be considered. Appli- 
cations can be obtained from the Division of Mathematics, National Academy of 
Sciences, National Research Council, Washington 25, D. C. by requesting an appli- 
cation for a travel grant to the 1962 International Congress. 

The deadline for filing of applications is November 1, 1961, and an attempt 
will be made to announce the grants by January 1, 1962. 

Awarding of grants will be made only to those persons whose applications bave 
been received, in good order, by November 1. The selection will be made by a com- 
mittee consisting of the regular Committee on Travel Grants of the Division of 
Mathematics of the National Academy of Sciences—National Research Council 
enlarged to include representatives of societies affiliated with the Division and 
representatives of various governmental agencies. 


NEWS ABOUT MEMBERS 


Laurence H. Baker is presently employed by Hy-Line Poultry Farms as a geneti- 
cist in Des Moines, Iowa. 
David Bruce is Chief of Division Forest Fire Research, Pacific Northwest 
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Forest and Range Experiment Station, USDA. He formerly held the same position 
at the Southern Forest Experiment Station. 

Martha W. Dicks is presently Assistant Professor of Human Nutrition at 
Montana State College. 

Charles W. Dunnett was awarded a D.Sc. degree in statistics on completion of 
a two-year period of research at the University of Aberdeen, Scotland, under Dr. 
D. J. Finney, F.R.S. The title of his thesis was ‘The Statistical Theory of Drug 
Screening.’ He has now returned to his position as Head of the Statistical Design 
and Analysis Department at the Lederle Laboratories Division of the American 
Cyanamid Company, Pearl River, New York. 

Lincoln J. Gerende is presently Instructor in Biostatistics, Department of 
Epidemiology and Public Health, Medical School, Yale University. 

Leo A. Goodman is a Visiting Professor of Mathematical Statistics and Soci- 
ology at Columbia University during 1960-61 on leave from his position at the 
University of Chicago. 

John Gurland, on leave from Iowa State University, is presently employed at 
the Mathematics Research Center of the University of Wisconsin in Madison, Wis- 
consin. 

James A. Hagans of the Department of Preventive Medicine and Public Health 
of the University of Oklahoma has recently received his Ph.D. degree from that 
institution. 

Carl E. Hopkins is now special consultant (Air Pollution Medical Studies) to 
the School of Public Health, UCLA. 

Leo Katz has returned to his position as Head of the Department of Statistics, 
Michigan State University. He was on leave of absence serving as Scientific Liaison 
Officer for the Office of Navy Research in London, England, covering statistics and 
probability in Europe. 

George H. Kennedy is presently employed by the Department of Health, 
Education and Welfare, U. S. Public Health Service in Bethesda, Maryland. He 
was formerly at the Biological Laboratories of the Department of Defense at Fort 
Detrick, Maryland. 

John R. Kinzer has recently returned to his position of Professor of Psychology 
at Ohio State University, Columbia. He had been on a two-year leave of absence 
to work at the System Development Corporation, Santa Monica. 

Gary F. Krause has given up his post as Instructor in Statistics at Kansas 
State University to become a full time graduate student in Statistics at the Virginia 
Polytechnic Institute at Blacksburg, Virginia. 

Dr. Kuo Hua Lee has taken a position as an Associate Professor of Biostatistics 
in the University of Oregon Dental School, Portland. 

William T. Lewish has taken a position as Special Service Engineer for the E. I. 
DuPont de Nemours and Co. in Wilmington, Delaware. 

Alexander M. Mood formerly president of General Analysis Corporation be- 
came Vice-President of C-E-I-R, Inc. and Manager of the Los Angeles Research 
Center of C-E-I-R, when General Analysis Corporation merged with C-E-I-R. 

Wesley L. Nicholson is on a four-month-special assignment as Professor of 
Mathematics at the University of Idaho. His permanent post is senior statistician 
for the General Electric Company in Richland, Washington. 

L. M. Potter, formerly Research Associate in the Department of Poultry 
Science at the University of Connecticut, is now Associate Professor in the Depart- 
ment of Poultry Husbandry at the Virginia Polytechnic Institute at Blacksburg, 


Virginia. 
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Mr. J. N. K. Rao, an advanced graduate student in Statistics at Iowa State 
University, has been selected as the recipient of the George W. Snedecor Award in 
Statistics, which consists of a cash prize of $25.00, a year’s membership in the Insti- 
tute of Mathematical Statistics and a year’s subscription to the Annals of Mathe- 
matical Statistics. 

Maynard W. Shelly, II, formerly a psychologist in the Office of Naval Research 
is now a mathematician in the Logistics and Mathematical Statistics Branch of 
ONR. 

John H. Smith is a Visiting Professor at the University of Chicago during the 
year. He is on sabbatical leave from the American University. 

Jerry Warren is presently Assistant Professor in the Horticulture Department 
of the University of Nebraska. He was formerly with the Vegetable Crops Depart- , 
ment of Cornell University. 
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ON ADDITIVITY IN THE ANALYSIS OF VARIANCE! 
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INTRODUCTION 


The problem relating to additivity in the analysis of variance is 
twofold. In the first place we wish to know whether we can remove 
any of the non-additivity present in our data, and in the second place 
we wish to know, given that it can be done, how to do so. Here two 
methods that have already been proposed for testing for non-additivity 
are developed from a somewhat different viewpoint, in an attempt to 
clarify their properties; and a generalization is given of a method that 
has been suggested for finding an appropriate transformation of the data. 


TESTING FOR REMOVABLE NON-ADDITIVITY 


Definition. If u;; is the subclass mean corresponding to the 7-th 
level of A and to the j-th level of B in any 2-way classification, then 
we shall say that A and B are additive (or the »;; show additivity) 
if and only if there exist constants a, , B; such that 


Mis =O; +B; , all 4,39. 


For an n-way classification, denote the subclass means by pu, , where ¢ 
is an n-tuple of numbers designating the particular subclass; then there 
is additivity if and only if there exist constants a;, , 8;,, °** , ¥:, Such 
that 


| Oi Garey) Une Aor a Oe + Bi, 4- eas a Ving all 44 ai: ie > Un A 


Now whether or not the u, show additivity depends on the scale 
on which they are measured. In practice we measure our data on any 
scale that happens to be convenient, and from these measurements 
estimate the u, ; and, unless we transform the data, we are estimating 
the u, as measured on the same scale as that on which the data are 
measured. But, in many cases, the estimates we obtain: are of far 
greater value to us if we can assume that the u, show additivity. Thus, 
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ON ADDITIVITY IN THE ANALYSIS OF VARIANCE! 


R. C. Eston 
Department of Biostatistics, University of North Carolina 
Chapel Hill, North Carolina, U. 8. A. 


INTRODUCTION 


The problem relating to additivity in the analysis of variance is 
twofold. In the first place we wish to know whether we can remove 
any of the non-additivity present in our data, and in the second place 
we wish to know, given that it can be done, how to do so. Here two 
methods that have already been proposed for testing for non-additivity 
are developed from a somewhat different viewpoint, in an attempt to 
clarify their properties; and a generalization is given of a method that 
has been suggested for finding an appropriate transformation of the data. 


TESTING FOR REMOVABLE NON-ADDITIVITY 


Definition. If u;; is the subclass mean corresponding to the 7-th 
level of A and to the j-th level of B in any 2-way classification, then 
we shall say that A and B are additive (or the u;; show additivity) 
if and only if there exist constants a; , 8; such that 


Mij = a; 4. B; ) all 4, ip 
For an n-way classification, denote the subclass means by yu, , where q 
is an n-tuple of numbers designating the particular subclass; then there 
is additivity if and only if there exist constants a;, , 8;,, °°: , vi, Such 
that 


CH ie = Oy, oP ea ar ee all dig ids ee tees 


Now whether or not the », show additivity depends on the scale 
on which they are measured. In practice we measure our data on any 
scale that happens to be convenient, and from these measurements 
estimate the u, ; and, unless we transform the data, we are estimating 
the », as measured on the same scale as that on which the data are 
measured. But, in many cases, the estimates we obtain: are of far 
greater value to us if we can assume that the u, show additivity. Thus, 
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if we analyze a set of data ya and find we have to reject the hypothesis 
that the corresponding », show additivity, we should try and find a 
transformation of y, to %, such that the E(x,) show additivity (where E 
denotes expectation). Since the y, are measured on & continuous scale, 
it is reasonable to consider only continuous functions of the Yc ; and 
for such a transformation to be of any practical use it must be a single- 
valued function. With the additional reasonable requirement that the 
inverse function also be single-valued, this is equivalent to requiring 
the function (or its inverse) to be strictly monotonic. 

From here on the symbol yu, will be used to denote subclass means 
measured on a scale on which they show additivity, and we shall denote 
by ya the data measured on any convenient scale. Suppose there exists 
a scale such that when the data are measured on it, and we denote 
these measurements by 2; , 


E(x.) =ut un. (where yp is any constant) . (1) 


Then the E(a,) show additivity. Furthermore, there is no loss of 
generality when we assume 


max p, + min yu, = 0, max p, = 0, min pu, < 0. (2) 


(We assume uy, lies within a definite range, determined by the extent 
of our data.) 
If E(y,) isa linear function of £ (x,), then the E(y,) show additivity. 
If H(y,) is a non-linear function of E(x,), then we can approximate 
this function by a quadratic polynomial, i.e. 
Ely.) = + aE(e,) + a[E (a)! (3) 
If E(y,) is better approximated by such a quadratic polynomial than 
by any linear polynomial in H(x,), then, provided our data lie within 
the range where this quadratic polynomial is strictly monotonic, there 
exists a single-valued function such that, when applied to the data, 
the expectations of the transformed data come nearer to satisfying 
the condition for additivity. 
Assuming exact equality in (3) and using (1) we may rewrite it as 
a polynomial in yu, , say 
E(y,) =a Dug ae Cua : (4) 
Differentiating with respect to u, we obtain 6 + 2cy, , and here there 
is a maximum or minimum. Thus E(y,) will be strictly monotonic 
over the two ranges 
(i) b + 2c min p, > O 
and 
(11) b + 2c max p, < 0. 
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Now using (2) it follows that if max u, — min pu, < b/ |c |, (i) is satisfied; 
and if max nz — min w, < —b/| c |, (ii) is satisfied (and in this case 
we must have b < 0). 

It is clear from this that there is no point in considering the case 
b = 0, and so we can arbitrarily let b = 1 (i.e. divide our model through 
by b) and rewrite (4): 


Ey.) =ut+ Mg + Cue ; 


where u and ¢ are appropriately redefined; and this is a strictly mono- 
tonic function over the range 


max pw, — min p, < ar (5) 


What we wish to do, then, is to assume a model of this form with 
the y, normally and independently distributed, and test the null hy- 
pothesis that c = 0. If we reject this hypothesis, then, provided (5) 
holds, there exists a monotonic function of the data (whose inverse 
is a quadratic polynomial) that will bring us closer to additivity. In 
order to test this hypothesis we should like to find the reduction in 
sum of squares R(I) due to fitting the model under the null hypothesis, 
and the reduction, R(I1), due to fitting the full model. These reductions 
are given by 


RD) - > Y; — min MS, (Yo ay oy a ae 
and 
RID = do) yi — min DY (Ya — b— Ma — Cha) 


where the summations are over all observations, and yp, yw, and ¢ are 
estimated from the data. We can find R(I) in the usual way; but 
_ when we set up the normal equations to find R(II), we obtain a set 
of cubic equations that are not easy to solve. 

The problem can be simplified, and an approximate solution ob- 
tained, as follows. For the sake of clarity consider the 2-way classi- 
fication, one observation per subclass. Under the null hypothesis 
c = 0 we have 


(J) E(yi;) Soft ay to Smee ang 
and otherwise 
(II) Ey) =u +a; +B; tea: + 8B)’. 


Now the only extra term in the right hand side of (II) that depends 
on both 7 and j is the term 2ca;6; , and it is only if the absolute magni- 
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tude of this term is ever large that (IT) can lead to a significantly smaller 
residual sum of squares than (I). We may therefore (neglecting the 
coefficient 2) consider the approximately equivalent model, (II'), given 
by 


E(yi:) ari = a; + B; ae ca;:B; } (6) 


j.e.we assume a; = a; + co; and B; = B; + cB; , and this is reasonable 
when we remember that c must satisfy (5) for monotonicity. 

In order to obtain R(II’), the reduction in sum of squares due to 
fitting the model (6), it is necessary to minimize > >i; (Yis — HO — 
8; — ca,8;)’, which results in the following normal equations: 

ye, (GS aE Boe ¢a,8;) = De Yii (7) 
XH @+ 4 + 8; + a8) + ,) = DY yal +683) 
(a + 4: +B; + a8) + 2) = De yi + €ad) 


1 


A A 
, YisiB; 5) 


vi 


I 


y (a + & + B + 68:8) @:8:) 


where fi, &; , 8; are a set of values for 4, a; , B; that minimize the above 
sum of squares. If we let >; 0: = DB; = 0, we obtain 


ie 2X Yai» (8) 

a: = = (yi; — A — B)UL+ B)/ 2 (1 + 6)’, (9) 
Bi = 2 ar — B— ON + ca:)/ Di (L + 64)’, (10) 
é= » yeiQiBs/ Qo (@,8,)’. (11) 


We can obtain f@ directly from (8), but &; , 8, and é have to be obtained 
by iteration. We first calculate @; and 8, from (9) and (10) assuming 
é = 0, and then, using these values of @; and 8; , we calculate 1é, a 
first approximation to é, from (11). Then assuming é = ,é we recalcu- 
late &; and 8; from (9) and (10), and use these new values to obtain 
a second approximation to é from (11). The process is continued until 
all the estimates become stable. R(II’) is then given by 


S. Ves oy OR ae B; = é4,8;)° 


Cz) OF] 


=f ya + ae ieee > Bb; De tiaee Dd yiid:B; 


a a 


when >>, &; = >»; 8; = 0 and the estimates are such that this reduction 
is a maximum. 
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This method of solution was given by Ward and Dick [1952], but 
they considered the model (6) for a different reason: they wished to 
test the appropriateness of a multiplicative model of the form 


Eyi;) = (u + a)(u’ + B;) = wp! + plas + uB; + a8; , 
and this led them to assume the model (6). They state that the F-test 
for ¢ is only approximate. If we assume as our full model 
Yi; = b + @ + B; aR ca;B; == Cra 
where the ¢;; are independent and N(0, o”), then to test the null hy- 
pothesis that ¢ = 0 we could take as our statistic 
ER Sr 
| ey — RODI/Q@ — 1)? 


where there would have been n degrees of freedom associated with 
the residual sum of squares if the model did not include the term ca;8; . 
But whether or not this statistic actually does follow the F-distribution 
on 1 and (n — 1) degrees of freedom under the null hypothesis requires 
further investigation. 

Now consider the sum of squares due to ,é, the first approximation 
to é, which is given by 


ee any =e Dirge re os Ce i aa By 50a) 
aed [5 es i, a, (Yi; aa bE ra &:B;) | 


= ate 2d [,¢(@,8;)° a 24:8 iY; sae fia, P; aa a8; = a,B;)] (12) 


where fi, &; and 8; are values of a, &; and 8, satisfying (7), (9) and (10) 
. when é = 0, i.e. we have 


Ui Oa By tee, (13) 


where yz fn dea Tae Now from (11) we have ,é = 
Danae oe (a,6;)°. Using this and (13), and letting >); @ = 
>=; B; = 0, we find from (12), 


ip GiB Yes]? [x 6B sé; ‘an 
( ONES SAAR ae = ae 
eg he DGB) a (4:B;) 
From this it is seen that the sum of squares due to ,é is Just the sum of 


squares proposed by Tukey [1949a] to test for non-additivity; the 
general formula given by Tukey [1955] is easily shown to be identical 
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with the above for a two-way classification when De aS De, Bie. 
Thus in Tukey’s test for non-additivity we assume the model (II’’) 


Osi ahy +b a, 1 8 cla:Bjlig 1 esi » 
e,; independent, N(, o°), where cla:Bi]ii indicates that the term is 
part of the interaction. This is to be interpreted as follows. We let 
Yii = KB +a; +B; + (a) + 


where >_; (a8);; = 0 for all j and a (a8) ;; = 0 for all. This uniquely 
defines » + a; + 6; a8 population parameters, but not a; or B; . 
Now for any set of a; and 6; that are consistent with this definition let 


QB; = ay Pn CB) 


where >; (a8)/; = 0 for all j and >»; (a@6);; = 0 for all 7. Then this 
uniquely defines (@6)f; , and this is what is represented above by [a:Bi li; - 
The parameter c in model (Il’’) can be thought of as the regression of the 
interaction effect (#@),;; on the product of the main effects. We then 
test the null hypothesis that c = 0 using the statistic 


oe ROI’) — RD . 
(Sa kal yD 


(14) 


Now since the y,;; are normally and independently distributed with 
variance o”, R(II’’) — R(1) is distributed, for fixed &; and B; , asa x 
on 1 degree of freedom; and under the null hypothesis this is a central 
x’-distribution, for then E (z;;) = 0. Also, since the distribution is the 
same for all fixed &; and 8; , this is in fact the marginal distribution of 
R(IU’) — R() as obtained from the joint distribution of 4a; , B; and 
R(IL”) — R(1), as was pointed out by Tukey. Furthermore, >0.; yi; — 
RAL”) = do &; — (RAV) — RG) can be shown to be independently 
distributed as ox” on (n — 1) degrees of freedom, n being the number of 
degrees of freedom associated with >i; @; . This shows that (14) 
follows an F-distribution on 1 and (nm — 1) degrees of freedom.” 
Hamaker [1955] has given an example showing that the sum of 
squares due to ¢ is more effective than the sum of squares due to ,¢ in 
accounting for non-additivity. But Tukey’s test has the advantage of 
being computationally simpler and of resulting in a test statistic that 
is known to follow the F-distribution. It should be noted that for this 
test we have to assume a model that allows for no interaction effect 
other than can be accounted for by a term c[a,8,],; . This is better 


2A more complete proof of this is given on p. 132 of H. Scheffé’s book ‘'The Analysis of Variance’’ 
[Wiley, 1959}. Scheffé also gives another motivation for Tukey’s test; this came to the author’s notice 
after the present paper was written. 
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than assuming a model that allows for no interaction at all, but not 
as good as we might wish. If we have more than one observation per 
subclass, we can assume the more reasonable model 


Tee eae Bp eB) esp; 
€:;, independent, N(O, 0”), and (a8),; = cla:8,],; + (e8);; 
and test the null hypothesis that ¢ = 0, i.e. that 
EQ) = w +a: +8; + @B)ii , 
(a8); being that part of the interaction effect that cannot be accounted 
for by a term c[a,8;];; . Then we can use the test statistic (assuming 


that there are n degrees of freedom associated with the within subclass 
sum of squares) 


e R(AII’’) — RD) : 
(within subclass s.s.)/n’ 


this follows the F-distribution on 1 and n degrees of freedom under the 
null hypothesis, and, if condition (s) hold, provides an approximate 
test of whether the interaction sum of squares can be significantly 
reduced by a transformation of the data (using a monotonic function 
whose inverse is a quadratic polynomial). 

The above can be easily generalized. For example, if we have a 
three-way classification, one observation per subclass, (6) becomes 
Ein) = w +a; + B; + ve + cl@i8; + ar + Biv); 

using an obvious notation. The sum of squares due to ,é is then 
[X (4:8; + I + Bess)” 
tt — 47 oa ee 
Coe Pe SSCL are ae 
- provided ay Ch = a 6;,= ee ¥, = 0. If there is more than one 
observation per subclass, the summations are extended over all observa- 
tions in each subclass. 

When there are unequal subclass numbers, it is often inconvenient 
computationally to use the restriction that the class effects should sum 
to zero (this is especially the case when a large body of data is being 
analyzed with the aid of an electronic computer). Analogous to equa- 
tion (11) we can find, for the two-way classification, the restriction- 
free formula 


_ LD WenleBi)is = webs = GLB is — Ble) 
ay [a:Bi lis 
=. (Yi jets B; a na,B; — 5B; as a.83)/ >, [a; Biles ) 
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the summations being over all observations. Substituting this value 
into the equation obtained from (12) on replacing @;8; by [eB]. 
(and allowing for more than one observation per subclass), we obtain 


1 GUS) a R(1) = 19D, (YisnOiBs = peuB; — GiB; > a8) : 

E, [a:B les 
the summations again being over all observations. The denominator 
in this expression is that part of SS (&,8;)° that is due to an interaction 
effect: it is conveniently obtained in a way analogous to that by which 
the interaction sum of squares of the y;:;, is obtained, but with the 
variable &,8; replacing y;;, for all k. 


FINDING AN APPROPRIATE TRANSFORMATION FOR ADDITIVITY 


Consider, as a function of p, , EG.) = har Me ae Cin lice 2c. 
this is a convex function, while, if c < 0, this is a concave function. 
Also, the inverse function given by 


ue = {-1L + V1 — 4el(u — BG,)]}/2c, (15) 


is conversely (where it is real-valued) concave if ¢ > 0 and convex 
ifien<e 0: 
Note that we can write the inverse function (15) 


Lu.) = (EQ) + (1/42) — #)™, 


where L(u,) is a linear function of yu, . This would suggest that, .1f we 
reject the hypothesis that c = 0, a transformation that might be ex- 
pected to bring our data closer to additivity is the transformation 
ce = (y + k) where y is the observed value and k is estimated by 
1/4¢ — a. However, in view of the fact that the inverse of model (II’) 
(exemplified by equation (6) for the 2-way classification) is slightly 
different from (15), and that in any case such a model is unlikely to 
offer the best fii to the data, attention should not be restricted to this 
transformation alone. Tukey [1949a] suggests that we empirically 
seek out the best transformation in the class « = (y + k)*, choosing 
p <li. > 0,sandypi et aloe {PVN Gi OCI ee eee 
¢ <0. This advice regarding the choice of p follows from the fact that 
just as (15) is concave or convex according as c > 0 orc < QO, 
soz = (y + k)? is concave or convex according as p < Loor- pas aile 
It is further suggested here that values of k approximating (4 | 1¢ \)-?? — a 
be tried first, since this is a solution to 


[E(y.) + (1/4¢) — al” = [B@) + KI’, 


when H(y,) is replaced by @. However, it has been found empirically, 


VARIANCE ANALYSIS ADDITIVITY 217 


from some artificially constructed examples, that this method of choos- 
ing k should only be used as a rough guide. 

Now suppose we have two different transformations each of which 
we think may help in obtaining additivity. (For example, we might 
have two choices of p and k for the above general class of transforma- 
tions; or we might have another entirely different transformation sug- 
gested by the nature of the data.) Then we might reasonably try and 
find the best linear combination of these two, i.e. a transformation of 
the form 


2 = dfity) + dofo(y) , 


where f, and f, are the two given functions and d, and d, are unknown 
coefficients. Tukey [1949b] has given a procedure for choosing d,/d>. 
His method can be extended to combine linearly more than two func- 
tions of the data, and the general method of obtaining suitable co- 
efficients d; will be given here. 

Let x = difi(y) + dof.(y) + --- + d,f,(y). We will choose the 
p coefficients d; given by the row vector d’ = (d, , dz, --- , d,) to maxi- 
mize a certain ratio of sums of squares of the z’s. Thus in a simple 
two-way classification Tukey considers maximizing 


row sum of squares of the z’s + column sum of squares of the 2’s, 


residual sum of squares of the 2’s 


The method can be used to maximize any such ratio; and, provided 
we have a within-subclass sum of squares available, a general criterion 
would be to maximize 


within-subclass sum of squares of the 2’s 


all interaction sums of squares of the a’s , 


since this would minimize any interaction variance component. 

Let s,(f:f;) and s2(f;f;) be the numerator and denominator sum of 
squares respectively of the transformed data f,(y) in the ratio it is 
desired to maximize. Thus if we use the general criterion suggested 
above, s,(f;f;) is the within subclass sum of squares computed from the 
transformed data f;(y), and s2(f,f;) comprises all interaction sums of 
squares as computed after using the same transformation. Let s,(fif;) 
and s2(f,;f;) be respectively the corresponding numerator and denomi- 
nator ‘sum of cross-products” of f;(y) and f;(y). For example, if 
si(f:f;) were a row sum of squares, s,(fif;) would be obtained as the 
sum of cross-products of row sums, divided by the usual divisor and 
less a corresponding correction term for the mean. If s,(fif;) is the 
within subclass sum of squares, then s,(f;f;) is obtained exactly analo- 
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gously, except that f:(y) and f;(y) are used to obtain cross-products 
instead of just f;(y) to obtain squares. 

Let S, and S; be p X p matrices whose elements in the 7-th row 
and j-th column are s,(f;f;) and s,(f;f;) respectively. 
Then 


$(7z) = SS disi(fifi) ue ye ae d; djsi(fifi) = dS, d, 
i=1 a=1 7>7 
where s,(xz) is the numerator sum of squares of the ’s and d isa column 
vector, the transpose of d’. Similarly 
$2(xx) = d’s, Gl. 


We thus wish to maximize d’S,d/d’S.d. Differentiating this with 
respect to d and equating to zero we obtain p equations that we may 
write in vector notation: 


Sid, @Sid S 
29,4 2@S,g77t = 


where 0 is a column vector of p zeros; 1.e. 


where 
_ dS.d 
aay me 


Thus the largest value of \ is the value of the largest characteristic 
root of (S;$;"), and using this value of \ we choose d to satisfy (16). 
The individual d; are not uniquely determined by (16), but all ratios 
d,/d; are. If we denote the element in the 7-th row and j-th column of 
(S, — Sz) by s;; , we can easily find, for example when p = 2: 


d;/ds = —812/8i1 } 

and when p = 3: 
d;/dz = (813822 = 812823) / (811823 we $12813) , 
d,/ds = (812833 ad S13823) /(S11823 a $12813) « 


If we can find a d which makes all interaction sums of squares of 
the z’s non-significant, we might assume the H(x) are additive and so 
estimate our class effects on this scale. Unfortunately, however, it is 
not clear what hypotheses can be tested: any F-statistics we may 
obtain are influenced by our choice of d. This procedure is therefore 


VARIANCE ANALYSIS ADDITIVITY 219 


best restricted to the case where two independent sets of similar data 
are available; d can be estimated from one set of data, and using this 
d the other set of data can be transformed and analyzed. Then, if we 
assume normality of the transformed data, tests of significance will be 
valid; though only one set of the data should be used in making them. 

The computational procedure in the case of unequal subclass num- 
bers will not be discussed in any detail here. However it should be 
noted that a general solution has already been given, in another context, 
by Roy [1957]. If we equate Roy’s x’ to the vector of transformed 
variables associated with the r-th observation y, , ie. to [f,i(y,), 
fe(y-), -** , fo(y-)], then S, and S, in this paper can be equated to 
Roy’s $ and S* respectively. 
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A NOTE ON SOME GROWTH PATTERNS IN A 
SIMPLE THEORETICAL ORGANISM 


J. A. NELDER 


National Vegetable Research Station, Wellesbourne, Warwick, England 


INTRODUCTION 


Hinshelwood and his co-workers [1946, 1955] have considered simple 
enzyme systems whose behaviour might mimic the growth of bacterial 
cultures under certain conditions. It is well known that the bacterial 
erowth cycle of cell number on a given volume of medium divides 
itself into three main phases. (Figure I). The first phase (or lag phase) 
represents a ‘settling-in period’ when growth may be slight or irregular; 
in the second phase, growth is logarithmic (that is log number is linearly 
related to time); finally in the third phase the relative growth rate 
falls from its constant value in the second phase to zero, often very 
rapidly, and the number in the colony becomes more or less steady. 
The lag time on transfer to a new culture may be generally defined by 
extending the linear portion of the logarithmic phase to the time axis 
(OT in Figure I). Considerable interest attaches to the form of the 
lag phase following (a) the transfer of cells from an ageing culture 
(i.e., one in the third phase) to a new medium and (b) the addition 


stationary 
phase 


ie) il TIME 


FIGURE I 


Ipeauizep Form or THE ReLatTion BETWEEN Loc. NUMBER oF CELLS AND TIME 
In A BacrerraAL CULTURE 
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of a drug to the new medium. Hinshelwood has considered [1946, 
chap. IV et seq.] forms of lag produced in a very simple theoretical 
organism consisting of two linked enzymes, I and II, both of which 
are autocatalytic; thus enzyme I uses substrate outside the organism 
to produce more of itself and also an intermediate C, while II uses C to 
produce more of itself. Cell division is assumed to be geared to the 
amount of II. One of his equations of growth, however, appears to 
need amendment, and it is fortunate that with this amendment the 
equations have an explicit solution so that it becomes possible to re- 
place his approximate treatment of the lag phase by an exact one in a 
number of different situations. 


THE EQUATIONS OF GROWTH 
We write X, and X, for the total amount of enzymes I and II in 
the system at any time, for the total number of cells, Y for the total 
amount of intermediate, and y = Y/n for the concentration of inter- 
mediate on a per cell basis. The equations for the growth of the enzymes 
are then given by 


aX, 


ap = Xs y (1) 
Axis 
ae = baXey, (2) 


while the growth of the intermediate is given by 
= AN — BAY = CY, (3) 


where AX, is the rate of production by enzyme I, —BX.y the rate of 
consumption by enzyme II, and CY the loss by diffusion and irrelevant 
’ chemical processes, which must be proportional to the concentration 
and the number of cells n. (The notation is the same as that in Dean 
and Hinshelwood [1955] except that we have used y for ¢ and have 
introduced Y = nc in their notation). Equation (3) differs essentially 
from the corresponding one given by Dean and Hinshelwood in having 
dY/dt = d(nc)/dt instead of n dc/dt. The difference arises because 
their equation does not take into account the fall in concentration 
of the intermediate due to the expansion of the system. ‘This fall will 
be given by —(y/n)(dn/dt) in our notation, so that in terms of con- 
centration the equation for y is given by 


ey 


y dn 
dt n n n dt 
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whence 


dy dn _ a ON re a ms 


thus reducing to (3), as it must. 

Hinshelwood postulates that the amount of enzyme II controls cell 
division which thus acts so as to keep X2/n, the concentration of II, 
roughly constant. In what follows, we follow him in ignoring the 
changes in X2/n and the diffusion coefficient C due to the discontinuous 
nature of the division process, and assume that the system can be 
taken as expanding continuously in such a way that X./n is constant, 
also that the area/volume ratio is constant so that C is unrelated to n. 
We therefore put X, = n6 where @ is a constant (and equal to 6 in 
Dean and Hinshelwood’s notation), whereupon equations (1), (2) and 
(3) become the linear set 


a a J u0Y, » = AX ae be Oe 


= Gy ) 


The general solution of (4) may be found by the usual methods 
and gives 


X= Ole ke, 
XG = Qik. Ae**' as Qs aoa Q:k.6e *', (5) 
Y= Qik, Ae + Q.0e 


where Q, , Q2 , and Q; are arbitrary constants anda = B6+C. Ast 
hecomes large the organism tends to a steady state where X, , X2 , and 
Y are all growing logarithmically in a constant ratio given by 


Xe & OG ; ¥ = kyla + ky) > k,6A ser (6) 


When the organism is set going from a state different from the steady 
state given by (6), a lag phase will appear before the organism settles 
down into the logarithmic growth pattern. Two types of lag will be 
considered: (a) those produced by a decay in X, or Y (these possibili- 
ties are envisaged by Hinshelwood as being consequences of ageing 
in a culture) and (b) those produced by a drug which changes the value 
of one of the parameters in (4). 


GROWTH FOLLOWING THE DECAY OF ENZYME AND INTERMEDIATE 
(i) The decay of X, 


Suppose that at t= 0, X, 2 Xe2Y = pkila +> ka) ho0An eS ie. 
that relative to X, and Y, X, has declined to a fraction p of the steady 
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state ratio given by (6). Solving for Q, , Q2 and Q; in (5) we find 


n = X./0 = constant 
/ cit 1 we —at 
x {(o/tse)" ee p(t a 1) —t=2es \ (7) 


Without loss of generality we may define the unit of n so that 
n = 1 att = O and the unit of t so that kj = 1. Then (7) becomes 


n= pe’ +(1—p) +2 —e*, (8) 


As t becomes large 
n~ pe’, Inn~Inp +t. 


Hence the lag time 7 as defined above is obtained by putting 
In n = 0, whence T = —In p. Now a is a positive quantity in (8) 
and (1 — e “*)/a@ is a non-increasing function of a for any positive ¢; 
asa—0, (1 — e **)/a >t while asa — ~, (1 — e&*')/a > 0 80 that 
extreme curves of the family (8) for fixed p are given by 


pe = py eb); (10) 
n= pe + (1 —p). 


I 


n 
and 


(ii) The decay of Y 

If the amount of intermediate Y decays to a fraction p in the steady- 
state ratio (6) while the enzyme amounts remain fixed, then we obtain, 
corresponding to (8), the equation 


m= Ca (lope e*) Ja). 
The lag time is thus zero and the curves lie between 


n=e —(1—p)t, (C=O), and is= 6, (QoS). 


GROWTH FOLLOWING CHANGES IN PARAMETER VALUES 


One possible explanation of drug action is that it changes values 
of the parameters in the growth equations; thus an intermediate may 
be inactivated by the drug and be rendered more or less unavailable to 
the next enzyme in the chain, or active sites on an enzyme may be 
blocked, and so on. In the simple organism considered here growth 
depends on 5 parameters, k, , k, , 0, A, anda = B6 + C. We shall 
not consider variations in @ or k, ; if @ varied, then the drug would 
change the mean cell size and so alter C as well, while if k, is permanently 
changed, the final*steady-state growth rate will be changed. We thus 
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consider effects on kz , A, and a which do not change the mean cell 
size or affect the steady-state growth rate after adaptation. In addition 
the drug will be assumed to be present in excess so that the effect on 
the parameters can be taken as fixed and independent of time. The 
results are summarised below: 


(iii) Fallin A 
The drug affects the production of intermediate by enzyme I. If 
A is reduced to a fraction p of its former value and cells are in their 
logarithmic phase before treatment then, following the same conven- 
tions as before 
n=pe +—p) + (0-90 - 6 *)/] 


so that the effect is identical to that of a corresponding fall in the 
amount of X, (equation 8). 


(iv) Fall in ke 


Here the drug affects the autocatalysis of enzyme II, and 
n=pe +1 —p), 


which is an extreme case, as a > ©, of equation (8). The lag in both 
these cases is given by 7 = —In p. : 


(v) Rise in a 


In this case the drug is assumed to mop up the intermediate as it 
is produced, thus increasing C and hence a. The quantity a can also 
be increased by increasing B, the rate of use of the intermediate by 
enzyme II, but this seems a less likely effect of drug action. The cell 
number is given by 


= | e\oaet (ee vole) ae 
(ie sreas a E + aje (1 — p) ; ee) (11) 


where p > 1, since a is assumed to have increased. The lag is given 
by T = In[{( + pa)/(1 + @)]. The extreme members of (11) are given 
n=e for a=0, (12) 

and 
n= ne (= a) for are ty (13) 


Equation (12) of course gives logarithmic growth with no lag, while 
(13) has been met previously as one of equations (10). 
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GROWTH FOLLOWING DISTURBANCE OF TWO QUANTITIES 
OR PARAMETERS 


In this section we consider only the growth following the disap- 
pearance of intermediate accompanied by changes in X, or one of the 
parameters A, k, , or a. Three of these cases produce the same equa- 
tion for n, namely those having a fall in X, , A, or ky to a fraction p of 
their steady state value. All these give 


n = pe’ + (1.— p) — [p(1 — e *')/al, 


with lag 7 = —In p, and extreme curves 
n=pe+(1—p)—pt for a=0, 
n = pe + (1 — 7) Lore Gr =" c0; 


The effect of a is not large here; the curves having a = 0 have a 
slightly more sudden rise to the logarithmic asymptote. 

For a rise in a to pa accompanied by an initial value of Y = 0 
we get 


n= 


t Qa ae 1 net 


The lag is T = In [(1 + pa)/(1 + a@)] and the extreme curves are 
given by 


m=e —f for a = 0, 
ne + (1 — 7), ~=1/p tor ¢= ©. 


EXAMPLES OF GROWTH CURVES 


Cases (i) and (iii) give the same family of curves. Examples of 
these are shown in Figure II (a), (b), and (c) for the extreme values 
of a (0 and ~), and for a = 1, the curves being given for lags equal 
to 2, 4, 8, and 16 generation times. (On our standard scale with k, = 1, 
the generation time is In 2 = 0.693 units.) Case (iv) is covered by 
Figure II(b), while case (v) has as extremes the simple zero-lag line 
In n = t for a = O and the curves of Figure II(b) fora = ~. For 
a = 1, curves very similar to those of Figure II(b) are obtained. In 
Figure II(d) an example is given for case (ii) with the extreme values 
a = 0, p = 0; for p < 1, these curves are sigmoid and asymptotic to 
the no-lag line In n = ¢ from below, the slope at the origin being p for 
all finite a. 

For growth following complete loss of Y accompanied by a change 
in X, , A, k. or a, the curves are similar to those of Figure II(b) for 
all positive a, and examples are not given. 
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FIGURE II 


Examp.es or Growrn Curves AFTER DISTURBANCE FROM THE STHADY STATE 


(a) Curves of the family n = pet + (1 — p)(1 + 2) 
(b) Curves of the family n = pet + (1 — p) 

(c) Curves of the family n = ne# + (1 — p)(2 — en?) 
(d) The curve n = et —¢t 


Notes: the ordinate is In n throughout. In (a), (b), and (c) p takes the values 2-2, 2-4, 2-8, and 2716 
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DISCUSSION 


Though the ‘‘organism”’ discussed in this note is a very simple one, 
the curves show that it is capable of a variety of different growth 
patterns after disturbance from the steady state. The curves exempli- 
fied in Figure IT(c) are somewhat similar to one of those shown in Figure 
10 of Hinshelwood’s book (Hinshelwood [1946]), and described by him 
as an “irregular growth curve’. However we have not succeeded in 
producing a curve such as that shown in Figure 11 of the same work 
where growth is very slight for about 4 generation times and the loga- 
rithmic phase is then entered very rapidly indeed. When the lag is 
as short as 4 generation times, none of the curves produced by the 
model has a near-horizontal early part. It is only when the lag reaches 
about 10 generation times that we find in some cases a substantial 
period at the beginning when growth is very slight. 

One way of producing a curve of the type of Hinshelwood’s Figure 11 
might be to postulate a drug which combined with the intermediate 
and prevented its use by enzyme II, the drug itself becoming inacti- 
vated by the combination. In this situation the amount of intermediate 
available to enzyme II would be kept very low until all the drug had 
been immobilized after which it would rise to the stable state level. 
The general question of the behaviour of the organism ‘growing away’ 
from a drug after immobilizing it is complex and does not seem amenable 
to exact solution. Similarly the growth equations for Hinshelwood’s 
other theoretical organisms, such as the one having two cyclically 
linked enzymes (Hinshelwood [1946], p. 81 et seq.), or alternative path- 
ways (ibid. p. 150 et seq.) appear to have no explicit general solution, 
though of course numerical solutions for given values of the constants 
could easily be obtained. 

A striking feature of the model is the occurrence of similar or identi- 
_ eal growth equations of n following different changes of the steady 
state. Since these changes have quite distinct biological interpreta- 
tions, it is clear that it will rarely be possible to deduce from the shape 
of a growth curve the sort of mechanism likely to have produced it. 


SUMMARY 


The growth equations for a simple organism proposed by Hinshel- 
wood, and consisting of two linked autocatalytic enzymes, are shown 
to produce a linear system under certain conditions. 

Solution of these equations gives the form of the lag in growth 
following (i) decay of one enzyme and/or the intermediate and (ii) 
changes in the values of the parameters such as might be “brought 
about by drug action. 
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Examples of the types of curves produced are given, and their 
resemblance to actual bacterial growth curves discussed. The produc- 
tion of identical curves by several distinct mechanisms in the model 
is pointed out. 
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1. Introduction 


The diallel cross, which is composed of all possible single crosses 
among a group of inbred lines, is now a common plan of investigation 
in both plant and animal breeding. Its modern use starts apparently 
with the development of the concepts of general and specific combining 
ability by Sprague and Tatum [1942]. The diallel cross is used to 
estimate the genetic components of the variation among the yields of 
the crosses (see, for instance, Hayman [1954a, b]; Jinks and Hayman 
[1953]; Griffing [1956a, b] and Kempthorne [1956, 1957]). It is also 
used to estimate the actual yielding capacities of the crosses. This 
information may be employed, for example, to select the best four 
inbred lines from which to develop a four-way cross. We shall discuss 
only the relatively simple situation in which there are no maternal 
effects, i.e. reciprocal crosses are identical and so need not be made, 
and in which there is no interest in the performance of the inbred lines 
themselves. The important questions concerned with replication over 
years and locations will not be considered. The methods used and 
conclusions reached in this paper may be capable of extension to more 
— complicated situations. 

With no reciprocal crosses or crosses resulting from selfing or crossing 
within the same inbred line, there are "C, = n(n — 1)/2 possible single 
crosses among” inbred lines. This number of possible crosses in- 
creases rapidly with n. When n = 6 it is only 15, when n = 20 it is 
190, and when n = 50 it is 1,225. With facilities available for testing 
only a limited number of crosses, a diallel cross may only be possible 
between a relatively small number of inbred lines. If only a small 
number of inbred lines are tested, the estimate of the variance of the 
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general combining abilities among the whole population of potentially 
available inbred lines will be subject to a large sampling error and 
many potentially high yielding inbred lines left completely untested. 
The question arises, therefore, of whether a design involving more 
inbred lines but only a sample of all possible crosses between them 
may not be preferable. A particular method of sampling the diallel 
cross will be considered and its efficiency in estimating the genetic 
variance components, the differences between the yields of the various 
crosses and the general combining abilities of the lines discussed. 


2. The Partial Diallel Cross 


The best method to be used in deciding how to sample the diallel 
cross would be to specify, in general terms, the probabilities of sampling 
each particular cross and each particular pair of crosses and then choose 
these probabilities so as best to satisfy the aims of the experiment. 
We have not been able to do this. We shall consider only the particular 
method of sampling the crosses suggested by G. W. Brown in about 
1948. The method certainly achieves some balance and whether a 
much better one exists is rather doubtful. It has already been used 
twice at Iowa State University (Jensen, [1959]; Sprague, unpublished.). 

Assume that the breeder can handle a total of ns/2 crosses where 
n is the number of inbred lines and s is a whole number greater than or 
equal to 2. Clearly, n and s cannot both be odd. The n inbred lines 
are numbered at random from | to n and the following crosses sampled :— 


Ime ios Ines -2905 ed = Sikes 

line 2 X‘hnes k + 2,4 + 3, --- ,; E+ 1s 

linet X lineskK +71,k +724+1,---,k+2-—1+s8 
linen X linesk-+n,k+n4+1,---,ktn—1+58, 


where k = (n + 1 — s)/2, and is a whole number, and all the numbers 
above n are to be reduced by multiples of n so as to be between 1 and n. 
For k to be a whole number, either n is odd and s even or n is even 
and sodd. Each line occurs in s crosses and the total number of crosses 
sampled is ns/2. s = n — 1 corresponds to the complete diallel cross. 
The reader may have noticed the analogy between this method of 
sampling the diallel cross and the experimental design for blocks of 
two plots in which the s blocks containing treatment number 7 also 
contain treatments numbered k +71,k +74 1,---,k+7i—14s. 
These circulant designs are discussed by Kempthorne [1953]. To 
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every design for blocks of two plots in which each treatment occurs 
the same number of times, there corresponds a method of sampling 
the diallel cross in which each line is involved in the same number of 
crosses, and conversely. Treatments 7 and j occurring in the same 
block corresponds to sampling the cross i X j. Any balanced in- 
complete block design corresponds to the complete diallel cross. Partial 
diallel crosses corresponding to other two plot per block designs have 
not been considered. Partial diallel crosses corresponding to other 
circulant designs, including some with n and s both even, (Zoellner 
and Kempthorne, [1954]), could be studied by methods very similar 
to those of this paper. 

We shall assume that the yield from the cross 7 X 7 in replicate 1 
can be written 


Yio = BT. oe Ee Gs sip Cat 5 


where uy is a general effect, 7, a replicate effect, g; and g; are the parental 
effects (sometimes called general combining abilities), s;; is the effect 
of the non-additivity of the parental effects (sometimes called specific 
combining ability) and e,;, is the plot error. We shall further assume 
that g; , s;; and e;;, are independently normally distributed with zero 
means and variances of , o, and o, . The motivation for this model 
is as follows. The yield of cross 27 in replicate / is assumed to consist 
of three parts combining additively: an effect of the cross, a replicate 
effect and an error due to plot deviation and also to segregation within 
the cross, if there is any. As regards the cross effect, different progeny 
of the same cross are full sibs, and progeny of two different crosses 
involving a common line are half sibs, and this is the only genetical 
structure. If the lines are a random sample from a large population, 
the cross effect can be represented by 


G,-F G; a ery 


in which g; , g; and s,;; are assumed to be independent random vari- 
ables with variances o7 , «7 and o; respectively, where 


c= COV (ELS) 
and 
o? = Cov (FS.) — 2 Cov (HS.,), 


HS. denoting half-sibs and F.S. denoting full-sibs. 

The diallel cross can be used with multi-flowered plants and, if the 
plants used are a random sample of the population of plants, the above 
covariances are covariances of relatives in that population. If further- 
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more that population has random mating structure, is not inbred and 
there is no linkage, 


Cov (H'S8.) = 104 + teoaa t+ > 


and 


x i 2 2 
Cov (FS.) = ,o4 + 6p 1 14g to 


If the parents are individuals arising by inbreeding to degree Ff in 
such a population, then (Cockerham [1954], Kempthorne [1957] - 


Cov (HS.) = ( = a i (it) Boas 


4 4 
and 
cov rs) = (LF) + (EF Yo, + Clo + 


If F equals unity, i.e. the parents are completely inbred, Cov (H.S.) is 
equal to the covariance of parent and offspring in the original random 
mating population and Cov (F.S.) is the genotypic variance in the 
original population. If the lines are only partially inbred, genetic 
interpretation is possible only if the degree of inbreeding is constant 
over the lines and is known. 

For a general value of F, all we know about o; and o; is that the 
former, ¢, = Cov (HS.), involves only the variances due to additive 
effects and interactions of additive effects and that the latter, of = 
Cov (F.S.) — 2 Cov (H.S.), does not involve the variance due to additive 
effects, o; . If the lines are completely inbred, 


2 2 2 
G, = 304 4 40a oe eons 


and 

2 2 2 
Cx Op + 3044 Seance) 
The total genotypic variance is 2° + o, and, if epistacy can be ignored, 
the additive variance is 2c; . The ratio of additive to total genotypic 
variance would then be 


20;/(20, + 03), 


and the square root of twice the ratio of dominance to additive variance, 
which, if gene frequencies are equal to 4, is interpretable as the average 
degree of dominance, is 


N/ Gone 
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If the lines are not inbred at all, ie. F = 0, 


2 


ees eee 2 

6, S204 TGTAA a Rs 
and 

aera ne Lp) 2 

i 20p + $044 ean oe 
If epistacy can be ignored, the ratio of additive to total genotypic 
variance is 


Tal (oe + 3) 


and the average degree of dominance, (see above) 


The plot error variance o; will contain a component due to genetic 
variability within crosses. In fact it will contain the total genotypic 
variance less the covariance of full-sibs and this is zero only if F equals 
unity. The environmental variability in the plot error e;;, consists 
of plot to plot variability plus environmental variability particular to 
each individual and, if there were competition, would also contain a 
component from this force. 

The above model seems entirely appropriate for the consideration of 
the estimation of «7 and o; , which are known as half the variance of 
general combining ability and the variance of specific combining ability 
respectively. Likewise it seems entirely appropriate for the estimation 
of the covariances of half-sibs and full-sibs, er estimation of related 
quantities like the average degree of dominance. 

It is less appropriate for the consideration of the yields of possible 
multi-way crosses, such as two-way or four-way crosses, because the 
plant breeder will be in the position of having a specified set of lines 
and is interested in the potentialities within that set of lines rather 
than within a random set of lines that he might have obtained. But 
it appears to us that even in this case the plant breeder has no alter- 
native to assuming an additive model in which the yield of cross 7 & 7 
is made up of an effect due to line 2, an effect due to line j, and an effect 
due to the non-additivity of the effects of lines 2 and 7. Earlier an 
attempt was made to consider the situation in the framework that 
there was a finite population of N lines from which n were drawn at 
random and the partial diallel cross made among these n lines. Un- 
fortunately, considerable mathematical difficulties were encountered 
and no neat result obtained. We shall therefore look at the problem 
of estimating all possible single crosses by the use of the simple additive 
model described above. 
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3. Estimation of the Variance Components 


We shall estimate the general combining abilities g: , -*- , Jn by 
least squares, i.e. by choosing @, g: , «+> , J, to minimize 


oy (Ga a ras 9)", 


where >>, denotes summation over the sampled crosses and y;; the 
mean yield of cross7 X j. With the imposed constraint 


do: = 0, 


this leads to the equation @ = 9, the average of the y,,’s, and to the 
following equations for G1 , --* 5 gn , 


= 2G ; 
> 39; =a S Eee aa =| = Q; ) (a ae iN 2, rie ,n); (3.1) 
i=2 r=0 ns 

where G@ is the grand total of cross averages and a;; = 8, all z, and 
Q;; = 4;; = 1 if cross? X 7 is sampled and a,; = a;; = O otherwise. 
The n X n matrix A = [a;;] is a symmetric circulant matrix and there- 
fore so is its inverse, A’’ = [a‘’]. (The general form of a circulant 
matrix is given in the appendix at the end of this paper. A symmetric 
circulant matrix has the additional property that the element in the 
ith row and jth column is the same as the element in the jth row and 
ith column for all 7 and j.) Being a symmetric circulant, the elements 
a‘’ of the matrix A™* are functions only of | 7 — j |, the positive value 
of (¢ — j). We shall therefore write a‘? = a'*''. By multiplying 
together rows of A with columns of A“, 


k+s-1 
> @ == (3.2) 
r=k 
and 
kts 
> Rae oe tl we (3.3) 
r=k 


In (8.3), the ¢ + r index in the summation is to be reduced by multiples 
of n so as always to be between 0 and nm — 1. By summing (3.3) from 
= 1tot = n — 1, and using (3.2), the sum of any row of A’ is 


Ds Ee SVS (8.4) 


The total sum of squares of cross averages in the analysis of variance 
of the partial diallel cross can be decomposed as follows: 
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Dd (Yi; — 9)” a > 9:9; + es, Un = y= 9e= g;) 


where # is the mean of the cross averages. 
Now from (3.1), and (3.4) 


Y 40 = © Y 09.0, 


n s—1 
= 2 om @ Ye shpat Geen = 2G’/ns. (3.5) 


i,7=1 1,m=0 


Fixing a particular sampled cross, i X (¢ + k + 1), and considering all 
sampled crosses, 7 X (j + k + m), related to it, either as full or half 
sibs, and using (3.2), (3.3) and (3.4), 


n c—- 2 
a(S 3 es eee = = a (a° ++ oo) 


7=1 m=0 


42 + Cov (F.8.) — 2 Cov as)} + Cov(HS.). (8.6) 


Summing (3.6) over 7 and J, again using (3.2), (3.3) and (3.4), and 
substituting an expression for E(G’), (3.5) gives 


ws 4.0.) =(n — D4” + Cov (F.S.) — 2 Cov as)} 
+ s(n — 2) Cov (HS.) 


Table 1 shows the analysis of variance of the partial diallel cross, with 
the expected values of the mean squares written in terms of o7 and o; 
instead of in terms of Cov (F.S.) and Cov (H8.). 


TABLE 1 


ANALYSIS OF VARIANCE OF PARTIAL DIALLEL Cross 


Source d.f. Expected values of mean squares 
Replicates r—1 
General Combining 
Ability n—1 ot + roz + {rs(n — 2)/(n — 1)} 9 
Specific Combining 
‘Ability n(s/2 — 1) ve + ro: 


Replicates X Crosses | (r — 1)(ns/2 — 1) | oe 


Total rns/2 — 1 
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With a complete diallel cross there are always many more degrees of 
freedom for the s.c.a. (specific combining ability) mean square than for 
the g.c.a. (general combining ability) mean square. Unless o? is small 
compared to «2 + ro? , this may be a serious disadvantage in the esti- 
mation of o? . The partial diallel cross does allow the (ns/2 — 1) 
degrees of freedom for crosses to be split more evenly between the two 
mean squares. Clearly, for 7; to be estimable, s has to be greater than 2. 
With s = 3, there will be nearly twice as many degrees of freedom for 
the g.c.a. mean square as for the s.c.a. mean square. With s = 4, the 
degrees of freedom will be approximately equal. Table 2 shows the 
breakdown of the degrees of freedom for an experiment involving 2 
replicates of 120 plots each when the crosses are (i) a complete diallel 
cross (s = n — 1 = 15), (ii) a partial diallel cross with s = aw = 80) 
and (iii) a partial diallel cross with s = 4(n = 60). The determination 
of the optimal numbers of replicates r, inbred lines n, and crosses per 
inbred line s/2 will involve the unknown values of o;/o; and o;/¢, as 
well as the cost function of the experiment. Another difficulty in 
deciding on a value for s is that an exact specification of the aims of the 
experiment is necessary. One possible aim of the experiment is to 
estimate components of genetic variance such as the additive and 
dominance portions and the goodness of the design will depend on 
these and on the inbreeding coefficient of the lines. A compromise may 
have to be made between various aims. The proportion of the genetic 
variance that is additive (Fisher, [1918]) and the average degree of 
dominance (Comstock and Robinson, ‘[1948, 1952]) are both, under 
severe assumptions (see Section 2), functions of o;/o, . Again, measures 


TABLE 2 


DEGREES OF FREEDOM FOR COMPLETE AND For PARTIAL DIALLEL 
Cross Ustne 2 ReriicaTes or 120 PLots Eacu 


Source Degrees of freedom 


Complete diallel cross Partial diallel cross 


Replicates il Ht 1 
CGacwar 15 79 59 
DaCuras 104 ; 40 60 
Replicates Crosses 119 119 119 


Total 239 239 239 
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of heritability will be functions of 07 , o? , and o®?. A design that mini- 
mizes the variance of the estimates of o? and o? does not necessarily 
minimize the variance of the estimates of these functions. For example, 
by increasing the degrees of freedom for the g.c.a. mean square at the 
expense of the degrees of freedom for the s.c.a. mean square, the variance 
of the estimate of o% is increased and the covariance between the esti- 
mates of of and o; made more negative. This may result in an increase 
in the variance of the estimate of o2/o> , despite the decreases in the 
variance of the estimate of o? . 


4. Comparison of Partial Diallel Cross with Other Designs for Estimating 
the Variance Components 


A comparison of the partial or complete diallel cross with other 
designs for estimating the variance components is complicated by all 
the factors mentioned at the end of the last section. The experiment 
analogous to what Comstock and Robinson [1952] called Experiment I 
would be to cross each of m randomly chosen inbred lines used as 
“sires”? with m different sets of f randomly chosen inbred lines used as 
“dams”. This type of experiment has been widely used in poultry 
to estimate covariance of full-sibs and half-sibs. No inbred line would 
be used both as a “sire”? and a ‘‘dam’’. The analogy is not complete 
since the experimental material considered by Comstock and Robinson 
was produced from random matings among plants of the /’, generation 
of a cross of two inbred lines, whereas we are considering the progeny 
resulting directly from crosses among a set of inbred lines. The analy- 
sis of variance of Experiment I is shown in Table 8. The variance of 
the estimates of both o; and of with this design can be shown to be 
greater than the corresponding variances for the partial diallel cross 
with n = mands = 2f. The two designs involve the same number of 


TABLE 3 


ANALYSIS OF VARIANCE OF AN EXPERIMENT IN WHICH m INBRED 
LINES ARE Hacu CROSSED TO A DIFFERENT SET OF f INBRED LINES 


Source dite Expected values of mean squares 
Replicates r— 1 ; 5 
_ “Sires” m—1 on + roe + r(f + 1) a9 
“Dams in Sires” m(f — 1) oe + ros + roy 
Replicates Crosses (r — 1)(mf — 1) oe 
Total rmf — 1 
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crosses but the partial diallel cross uses f fewer inbred lines. The 
covariance between the estimates of o, and o” is less negative with 
the diallel cross than with Experiment I. The experiment analogous 
to Experiment II of Comstock and Robinson [1952] would be to make 
all the crosses between m randomly chosen inbred lines used as ‘‘sires”’ 
and f randomly chosen inbred lines used as “dams”. No inbred line 
would be used both as a ‘‘sire” and a “dam”. The analysis of variance 
of this design is shown in Table 4. An estimation procedure that does 
not use both the sire and dam mean squares to estimate 0, can be 
shown to be inferior to the estimation procedure possible with a partial 
diallel cross with n = mands = 2f, ie. a partial diallel cross involving 
the same number of crosses but f fewer inbred lines. Comstock and 
Robinson [1952] have suggested that, when possible, Experiment II 
should be designed with m = f. This is not necessarily optimal, but it 
does permit a simple pooling of the dam and sire mean squares. Un- 
fortunately, it also results in many more degrees of freedom for the 
s.c.a. mean square than for the g.c.a. mean square. There will be 
(m — 1)/2 times as many degrees of freedom for the s.c.a. mean square 
as for the g.c.a. mean square. Unless; is small compared with o2 + rej, 
this may be a serious disadvantage in the estimation of o? .. Indeed 
one of the main general reasons for using the partial diallel cross is that 
it gives a reasonable number of degrees of freedom for the g.c.a. mean 
square. 

A comparison of the partial diallel cross with Experiment III of 
Comstock and Robinson [1952] and other more complicated designs 
will not be attempted. Experiment III may be superior to the partial 
diallel cross. A complete comparison of the two designs would again 
involve the values of unknown parameters and would also raise questions 


TABLE 4 


ANALYSIS OF VARIANCE OF ALL CROSSES BETWEEN 
Two DirFrERENT SETS OF m AND f INBRED LINES 


Source aate Expected values of mean squares 
Replicates r—1 
“Sires” m— 1 oe tro. + roy 
“Dams” f-1 oe + ros + rmo; 
“Sires X Dams” (m — 1)\(f — 1) otro 
Replicates X Crosses (r — 1)(mf — 1) oe 


Total rmf —1 
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regarding the sensitivity of the estimation procedures to departures 
from some of the genetic assumptions. 

All the designs discussed so far have involved crosses between lines 
both of which were drawn at random from the population of inbred 
lines. One result of this is that all of the variation between crosses 
can be used to estimate either of or o. . There are two serious dis- 
advantages involved in the use of common testers that are not a random 
sample from the same population as the inbred lines. Firstly, the 
general combining ability of a line has to be defined as the average 
performance of that line when crossed to all members of a certain 
population. The average performance of an inbred line when crossed 
to a set of common testers will almost certainly not be the same as its 
average performance when crossed to all other members of the popula- 
tion of inbred lines. This could seriously invalidate the estimate of 0? . 
Secondly, there would seem to be little or no reason to assume that the 
variance of the specific combining abilities among crosses between 
inbred lines and common testers is equal to the variance of the specific 
combining abilities among crosses between the inbred lines themselves. 
These considerations strongly suggest that common testers should not 
be used to estimate the values of «7 and o% for the population of inbred 
lines, unless the common testers are themselves a random sample from 
that same population. 


5. Comparing the Yielding Capacities of the Crosses 


We shall make the same assumptions and use the same notation as 
in the previous sections of this paper. We shall consider the partial 
diallel cross as a method of estimating the yielding capacities of all 
the possible single crosses among n inbred lines. Inbred lines are 
often developed and then crossed in an attempt to produce crosses 
with a high yielding capacity. The yielding capacity of all crosses in 
the diallel cross and all sampled crosses in the partial diallel cross can 
be estimated in each of two ways. Firstly, they can be estimated by 
their mean yields in the experiment. Secondly, specific combining 
ability can be ignored and the yielding capacity of the cross 7 X j 
estimated by 2 + g; + 9; .° Clearly, the latter method has to be used 
to estimate the yielding capacity of all the unsampled crosses in the 
partial diallel cross. 

We shall designate the methods of estimation A and B, where A 
and B are defined as follows: 


3The two estimates could possibly be given appropriate weights and combined into a single esti- 
mate. For the partial diallel cross these weights are rather involved and vary from cross to cross. In 
this paper, the two methods of estimation will be considered separately and then compared. 
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A. Unsampled crosses estimated by @ + 9; + g; but sampled crosses 
estimated by cross means Y;; . 
B. Both unsampled and sampled crosses estimated by @ + 9: + 9; . 


A more precise statement will be made later, but one may summarize 
briefly that B will be preferable to A only if ro{/o? is small. 

A third method C of estimating the yielding capacities of the crosses 
is to cross all n inbred lines with each of t common testers. As mentioned 
previously, general combining ability is not a property of the line alone 
but a property of the line relative to the population of lines to which 
it has been crossed. To speak of the general combining ability of a 
line without reference to that population is vague if not meaningless. 
If common testers are to be used to estimate the yielding capacities of 
crosses between the inbred lines then we have to make the assumption, 
and it is a big assumption, that the general combining ability of each 
inbred line relative to the common testors is the same as its general com- 
bining ability relative to all the other inbred lines. We hope that the 
importance of this assumption will not be lost in the discussion that 
follows. Without it there is no basis for comparing the diallel cross 
with common testers. In method C, we shall estimate the yielding 
capacity of cross? X j by A + 9g; + g; where g; and g; are the estimates 
of the general combining abilities of lines 7 and 7 with the common 
testers. Like B, C willbe preferable to A only if ro?/o2 is small. Neither 
of the two designs alternative to the diallel cross considered in Section 4 
can be used to estimate the differences between crosses without con- 
founding some of these differences with other contrasts between the 
general combining abilities. They will therefore not be considered 
further. 

In all three methods, A, B and C, the errors involved in comparing 
any two crosses will be composed of two parts. There will be errors 
arising from an inadequate estimation of specific combining abilities 
and from the specific combining abilities that enter into the estimates 
g; . There will also be errors arising from the plot effects. To measure 
the error in comparing any two crosses, we shall calculate the expected 
value of the square of this error. It will be composed of two parts, 
one involving o; and the other o? . The criterion Q, used to measure 
the efficiency of the various methods, will be the average, over all 
“YC, possible comparisons among the n(n — 1)/2 crosses, of this 
expected square error. The values of Q for methods A, B and C will 
be written Q4 , Qz and Q¢ respectively. 

With all three methods of estimation, comparisons can only be 
made between crosses involving the inbred lines actually tested. This 
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is an obvious statement, but that any design not involving each inbred 
line at least once has @ = © should be realized. As a result, the cri- 
terion @ cannot be used to decide on the optimal division of resources 
between the number of inbred lines on the one hand and the number of 
replicates and crosses per inbred line on the other. The number of 
inbred lines tested n is fixed and we need to estimate the yielding 
capacities of all crosses among them. A decision on the value of n 
would presumably involve a compromise between the accurate assess- 
ment of a relatively small number of crosses and the relatively inaccu- 
rate assessment of a large number of crosses. The more crosses tested 
the more intense can be the selection applied to them but the larger 
will be the errors involved in that selection. Problems of this kind are 
discussed in a series of papers by Finney [1958 a, b; 1960]. The methods 
he used to study mass selection in plant breeding cannot be applied 
directly to the present problem because of the unequal accuracies and 
the correlations involved in the estimation of the different crosses and 
because of the genetic covariance o; between crosses involving a common 
line. Also, the aim may not be simply the selection of the best crosses 
but, for example, the selection of the best four inbred lines from which 
to form a four-way cross. 

Two other limitations of the present approach should be noted. 
Firstly, we are considering only one property of the crosses, namely 
‘yield’. With more than one property, a compromise may have to 
be made and a design chosen that is optimal for one quantity but not 
for another. Secondly, we are considering all crosses among a set of 
inbred lines. Of more interest sometimes would be all crosses between 
two different sets of inbred lines that are chosen to bring together certain 
properties that are present in one set but not in the other. 

The algebra needed to calculate the values of Q for the two methods 
of estimation A and B is very tedious. The comparisons among the 
crosses have to be divided into the three categories: sampled versus 
sampled, unsampled versus unsampled, and sampled versus unsampled. 
Using equations (3.2), (3.3) and (3.4) to sum over these categories, 
we find, 


On Tee ea [{2n(n — 2)sa° + ns* — 2ns — n + 2} 
-{o2 + (62/r)} + {n? — 2ns — n + 2}s0%] 
and 
2 0 PA 2 
oa MeL Vie 1), [{2n(n — 2)sa° — n + 2} fo, + (e2/r)} 


+ {n? — 5n + 2}so%], 
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: < é c = 
where a° is, as before, the diagonal element of the inverse matrix A’. 
The value of Q for C, the common tester method, is 


egy 
aa EE hy 


where ¢ is the number of common testers, 7 the number of replicates, 
and o?,,, the variance of specific combining ability among the crosses 
between the inbred lines and the common testers. The quantity 20% 
occurs in Q¢ because the estimate of each cross comparison 7 X 7 versus 
1 X mis biased by the amount 8;; — Sim - To compare A, B and C for 
the same total number of crosses and experimental plots, we shall 
assume that ¢ = s/2, and that the two values of r are the same. The 
assumption that o2,,. = o, has already been criticized in connection 
with the estimation of o? and o? . However an assumption has to be 
made about the ratio o?,,,/o7 and o%;,,/¢, = 1 seems the most reasonable. 
Other values could be studied by using the general form for Qc given 
above. 

A comparison between the two methods A and B is immediately 
possible. 


[o2cy + (at/r)] + 20%, 


2n(s — 2) 2 2 
Q,—-Q: = pet [(o2/r) — o:], forall n and s. 


As is otherwise obvious, the two methods are identical when s = 2. 
If s > 2, A is preferred to B, ie. Qa < Qe , if and only if Of hae ele 
If r > 1, a decision between A and B could be based on an estimate of 
o?/ro? obtained from the analysis of variance of Table 1. To compare 
method C with methods A and B, the values of a’ are required. The 
matrix A~' can be easily determined when s = 2 and when s = n — 1 
(the complete diallel cross). When s = 2, A‘ is generated as a circulant 
by the first row 


nn n Wane n 1 n 1 
De ey eee ee 
(8 oT) : , Veoe AP oe eae 
Te pees n 
Le 2~1) 
and, when s = n — 1, A” has all diagonal elements equal to 
(2n — 3)/2(n — 1)(m — 2) and all non-diagonal elements equal to 


—1/2(n — 1)(n — 2). Substituting a° = n/4 in Qu and Qz , we have 
that, when s = 2, 


=i ; ‘ f 
(n a S 2) [ee 2) G3) ote (og / rt) ee aa le 


Qs =Q,=Qc+ 


THE PARTIAL DIALLEL CROSS 243 


When n = 3, the complete diallel cross is preferred to common testers 
unless of = 0, when they are equally efficient. When n = 5, the partial 
diallel cross with s = 2 is preferred to using one common tester if and 
only if of/roy < 4. When n > 7, using one common tester is to be 
preferred to the partial diallel cross with s = 2 for all values of 5.) TO 
Substituting a° = (Qn — 3)/2(n — 1)(m — 2) in the expression for 
Qs , we have that for the complete diallel cross (s; = n — 1) 


4 
(n — 2) + 1) 


The complete diallel cross with estimation method B is therefore always 
to be preferred to using (n — 1)/2 common testers. The complete 
dialled cross with estimation method A will be even better if gfro. <1, 

Values of a® for s intermediate to s = 2 ands = n — 1 are not so 
easy to obtain. An explicit expression for a’ is derived in an appendix 
at the end of this paper. Expressions for the other elements of the 
inverse matrix A’ are also derived and may be of use in obtaining 
variances for comparing the yielding capacities of particular crosses. 


The formula for a’ is 
1 i sin | 
ey 


=. jr. (n—8)jn(— 
= 5 sin FF — gin @ = Die | 
n n 


Qe + {nm — 3)(o2/r) + (bn — 5)o.}. 


Values for a” calculated from this formula on an IBM 650 electronic 
computer agreed very well with known values for s = n — 1. The 
agreement for s = 2 and n large was not so good. The values of a° 
for s = 2 in Table 5 were calculated by a® = n/4. All other values 
were calculated electronically using the formula above. They are 
thought to be sufficiently accurate for our purpose but not necessarily 
accurate to the number of decimals shown. 

Table 6 shows the values of Q, , Q, and Qe for these same values of 
mands. The first figure for each method is the coefficient in Q of o?/r 
and the second is the coefficient of o7 . The bordered squares are those 
in which the common tester method is to be preferred to the diallel 
cross method. They are the squares for which s/n is small. In all 
other squares, the best method is A or B according as o7/ro? is less than 
or greater than o?/ro2 = 1. The only exception to this is that when s = 2, 
A and B are equivalent. Also, for each value of n there are, presumably, 
intermediate values of s for which the decision between the diallel 
cross and common testers depends on the unknown value of o2/ro. . 
The examples of this in Table 6 are n = 101, s = 10; n = 50, s = 7; 
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TABLE 5 


VALUES OF @° FOR VARIOUS n AND $ 


n 2 3 4 5 6 7 10 20 50 100 
101 25.25 2.588 0.8332 0.2369 | 0.06515 0.02050 0.01005 
100 6.031 1.365 0.5530 
75 18.75 1.969 0.6523 0.1978 | 0.06027 “0.02034 
a 4.515 1.044 0.4387 
51 12.75 1.391 0.4836 0.1616 | 0.05575 | 0.02020 
50 3.119 0.7514 0.3326 
21 5.25 0.6739 0.2732 0.1162 | 0.05132 
20 1.394 i 0.3868 0.2007 
11 2.75 0.4248 0.2014 0.1056 
10 0.7689 0.2604 0.1589 
g) 2.25 0.3721 0.1904 : 
8 0.6458 0.2387 0.1548 
7 LU T(S) 0.3233 0.1833 
6 0.5139 0.2250 : 
5 1.25 0 2917. 
4 0.4167 
re 0.75 


n = 75,8 = 74; a9 = 0.01361; n = 9, s = 8; a9 = 0.1339. 


n = 10,s = 3;n = 8,s = 3andn = 5, s = 2, when the partial diallel 
cross is preferred to common testers if and only if o?/re; < 0.25, 0.42, 
0.63, 5.93 and 0.33 respectively. 

For given values of n and o?/c; , Table 6 could be used to compare 
the values of Q4 , Q, and Q-¢ for different values of r and s.r will have 
to be greater than 1 if o? is to be estimated and s greater than 2 if o% is 
to be estimated. 


6. Estimation of General Combining Abilities 


The variance of the estimate of the difference between two general 
combining abilities using the partial diallel cross is 


245 


THE PARTIAL DIALLEL CROSS 


‘U JO ON[VA PUES OY} OJ OLB SON]VA SUIUIBUIOI ‘MOI OY} IOJ UMOYS U Jo aNIBA 4SIG ay} IO} ae 9dAy VOB Plog Ur soN[eAy 


00°F 00°2 

00°0 00° Z 

00°0 00°z 

AS (TSI 09's 09°F 19° 49°72 

BE 680 O8'O O6'T cc F 00 + 

00°0 §=6©00 2 00°0 002 KA 7 00° + 

oo'€ OO T | GE 62T| OS € OST 06§ O68 T 00's 00°¢ 

OF, T0970) 6650 LZ O28 LOE EG 8 2L29°T 038°9 00°9 

0050) {0092 0050) O08Ga) AT he 227% E8it “OES 08°9 00°9 

os'z 08 0 68°G 680] 40€ LOT] 2S FET | 09'€ ODOT L0% LOS oz Ss oz € 

ST 9F'0 SHEL GSO PLak  SOsON 22) te 08 ONILSe can Ocal SI'S 66% 60°6 00°8 

00°;0 002 00°0 004 | 140 891/880 69'T | 98'T ost 68'S 6S G 60 6 00°8 

g=s 

LONS LOLOW 76 Ge FOO) Le Ea TS: CeCe Pale LO Cee ee OnE Sly Sl 2 ES cere 

OT TAEVON S21 SSS ONZOnt TALON G0) GLE 10n SA ZacSak c6E PLS Sack 0 OT 

010 OOKZs OOF ON 69 FP OIST cS ah |i 6S Te Lenc em cont 69°€ 96% Serr 0 01 

OFaC OS. OF SLee ELE 0) eCOncecO mba stca twee Cale Gr scumG yaa Pas cece L Der SLs t9'S vO'€ 

IST) OL OVPSOcC Sh ON Sera o2) ON SOTe cOr tr S0! eS Grol Lbs ban GG z 88°9 82a°¢ Cn 4 0°02 

0050) 00° "SZ re TE) Gash SG Tell Perc ey P| cleo) Zao & Shas 449 8&8°¢ Oz 0°02 

SE) ST O)62. 2 88.0 Lee AL 0) 01 SO Te 8e S aSe Tere oe pGune 26 Cee cone 99°F 99°¢ sss s3'¢ 

26,1 820°0 ||90°¢ 220) | Lhe (€9'0' | FL-S OS P| SL2€° 6891 | SLi Ferm | oz72) SEs Ee eo G 81g 00s 

00°O0 00°2 | v€'T 760) STZ $60] 86% OF 1} AS SO'Z/] 99% 90'S | Iz4 Es's OFL CL 81g 00s 

Ofc S010 90.2 OST") | GExc 69°0)) Shee" 84°05) TES Tht | 0e es osst | 9G" 99° TL | S6,€ “S671 09 F 097% 68'S 68 ¢ 
S6T €S0°0 | 46 IT 080°0 | €1' 2 #20/ 49°% 840] 29'S E21] OF F LSZ] 10:9 Il'b]99°6 LLL ZeOL) SeAL 6 SL a 74 
00°0 00°2 | 89°0 8E TT} 9'T 24°0 | OFZ 660] 8F'S 98'T | 9E% 69°72] 26S 02 F| 6S'6 ZBL 46% St 6 SL 7A 

PL= 8 
80°2. 84050) 91 2 LST 05) 68.c) 60 S472 820) or 8) Clon) Tee) esr |uere orp loo e. oouT 19% 19°S 76'S 26'€ 
96°T 0¥0'0 | 00'% 180°0 | 812 92'0 | 98°% %6°0] Il'% GIG | 2S O¢'€ | Ze L OFS | AL'ZI Sz‘OT 89S 6&2 6 101 0°O0T 
00°0 002 /} 70T 0'1L | 28E 29°0} OL OL IT | 10% 62°% | PI'S SE's | 9°42 OFS | EI'ZE Bz OT 89S 6 8% 6 10 0 'O0T 
OOT og 06 Or 9 ] v 


SANV U dO SHATIVA JO ADNVY V UOd %) ANV ZH ‘VY NI 72 UNV 4/79 40 ,SLNAIOIMAGOO 


9 HIAVL 


10) 
Ce: 
Vv 
(Oy ty 
el 
Vv 
ey 
aq & 
v 
0 8 
ad 6 
¥ 
OMOL 
ae iis 
Vi 
OQ 06 
ts 14 
Vv 
Os 
es 
V 
O 2 
aq sd 
Vv 
Oo OOT 
See LOE 
Vi 

u 


246 BIOMETRICS, JUNE 1961 


VG: =) = 2@ So ie 
Expressions for a’ and a''~"' derived in the appendix at the end of this 
paper could be used to evaluate this variance for each pair of values 
of i and 7. We shall consider only the average of this variance over the 
n(n — 1)/2 possible comparisons of general combining abilities. Using 


(3.4), this average variance is, 
0 


l 1 2 2 
Av. VQ: = gi) 7; of 1 we Qs(n a = le AG (o2/7)). 


With s/2 common testers, the variance of all comparisons will be 
Vigo 30 = A/sloscy + (a:/r)]. 

Again making the big assumptions that the general combining ability 
of each line with the common testers is the same quantity as the general 
combining ability of each line with all other inbred lines and that 
ov, = 0, , the average efficiency of the partial diallel cross relative 
to common testers in estimating general combining abilities is H = 
4(n — 1)/(2nsa° — 1). We have taken the total number of crosses 
as well as the total number of experimental plots the same for both 
designs. Values of H can be calculated from Table 5 and are shown 
in Table 7. When s = 2, E = 4/(n + 1) and when s = n — 1 (the 
complete diallel cross), H = 2(n — 2)/(n — 1). The partial diallel 
cross is very inefficient compared with common testers when s = 2 for 
all values of n above 3. The complete diallel cross is always at least 
as efficient as common testers and tends to be twice as efficient as 7 be- 
comes large. The partial diallel cross with intermediate values of s 
is often at least as efficient as common testers. If having slightly un- 
equal standard errors for the different comparisons is not a serious 
disadvantage, the partial or complete diallel cross is a better design 
for estimating general combining abilities than common testers provid- 
ing that a sufficient number of crosses can be made so that s/n is a 
reasonably large fraction. Providing s > 2, the standard errors for 
comparisons can be estimated by using the analysis of variance shown 
in Table 1 for the partial (and complete) diallel cross and the usual 
analysis of variance for common testers. 

For a fixed total number of crosses, the partial diallel cross does 
permit more lines to be included in the experiment than does the com- 
plete diallel cross. In selecting lines for general combining ability, 
the resulting increase in the selection intensity will often more than 
compensate for the decreased accuracy with which each line is assessed. 


7. Summary 


A diallel cross among n inbred lines with no parental or reciprocal 
crosses involves a total of n(n — 1)/2 crosses. Clearly, this number 
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increases rapidly with n. With limited facilities, this may mean that 
a complete diallel cross can only be made among a rather small number 
of inbred lines. The design presented in this paper allows a large 
number of inbred lines to be studied by performing only a sample of all 
the possible crosses among them. The efficiency of the design for the 
estimation of the variances of the general and specific combining abilities 
of the lines, for the prediction of the yielding capacities of the various 
crosses and for the estimation of the general combining ability of each 
of the lines, is discussed. The design is shown often to be more efficient 
than other designs that have been proposed. 
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APPENDIX 


INVERSION oF Matrix A 


A is a real symmetric circulant matrix and so can be inverted ex- 
plicitly without much difficulty. Consider the general circulant matrix 


ig ws a 
Gn-1 An-2 
ia oe 
ee ce 
Let w; = exp. {j(2ri/n)}, (j = 1,2, --- , ”), be the n-th roots of unity. 
Then 
1 1 
W; W; 


ry Doe (ay + aww; + +++ + a,awy")]) wv; 


i 
° 


n—1 n-1 


W; W; 
Therefore, the characteristic roots of A are 
A; = A + au; +--: + a,j," je recent ties (1) 


We can invert these equations to obtain the a’s in terms of the )’s, 


pA 


Mo = TA ba ee Nal 
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and 
1 7 J : 

Qa; = n [Aiwa = NoWp-2 —- ha + Nels i] a Ii, Dy eel > n a Ie 

Now, A is also a circulant matrix and has characteristic roots 
1/\;(7 = 1, 2, --- , n). Therefore its elements can be written 

1 1 
a = ie +S uid. oe | 
and 
rsa (5! 1 
fait ttt 2], Geely ee Le (2) 
di n 


Since the matrix A is real and symmetric, its characteristic roots }, 
and the elements of the inverse matrix are both real. Substituting 
Qo = S, Ay = Ay = -°* = Oyen = Ones = Aeorr = *°* = Ani = Oand 
@ = Ga =~ = Gs.-1 — Pin’ CG) and taking real parts; 


sin (n — s) = 


N= Si 3 a, He vily, 
or 
Sine 
n 
and (3) 
ey = WS. 
Taking real parts on both sides of equations (2), 
eee! E on tee 1] 
: nm LrAy + Ao a i Agshe 
and 
ema hies Ut liom aU) ane 
CE es of 4 2 7 = 1,2, Reed (4) 


The elements of the inverse matrix can therefore be obtained by 
substituting the equations (3) for the \’s into the equations (4). This 
method was used to compute the values of a’ in Table 5. For small 
values of n, the methods available for inverting general matrices may 
be preferable to the special method outlined above for inverting 
circulant matrices. 
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SENSORY TESTING BY TRIPLE COMPARISONS 


G. T. Park 
T. Hedley and Co. Ltd., Gosforth, Newcastle upon Tyne, England. 


1. INTRODUCTION 


In sensory test work, the number of samples a judge can assess at 
any one time is often limited by sensory fatigue and confusion of stimuli. 
This necessitates the use of incomplete block designs. Further, the 
difficulty of obtaining satisfactory quantitative measures of treatment 
effects usually entails assessment by ranking. Thus there is need for 
a rank analysis of incomplete block designs of a versatility equivalent 
to that available for variables satisfying the conditions of analysis of 
variance. In addition, any such analysis should provide the experi- 
menter with a check as to whether he can regard the treatments con- 
cerned as lying on a linear scale of acceptability. For, with sensory 
assessment, cases can and do arise where the relations between calculated 
average scale positions disagree with individual direct comparisons. 

These needs have been met in the case of blocks of size two, by the 
various paired comparison procedures developed in recent years [2, 4-6; 
13, 14, 18, 17]. Although this is undoubtedly the most important case, 
instances arise where blocks of size three or more are a practical propo- 
sition and hold out the possibility of increased efficiency. 

Unfortunately, little work seems to have been done in the general 
case of blocks of size k. The result published by Durbin [7] allows an 
overall significance test of treatment differences on balanced incomplete 
block ranking experiments, whilst the rankit method of Fisher [10] has 
been used by some workers, but no generally accepted method (of 
adequate versatility) has been available for blocks of size other than 
two. The method recently published by Pendergrass and Bradley [3, 15] 
which offers a comprehensive treatment of the case k = 3 is therefore 
of immediate interest. 

This paper is to assist evaluation of the Bradley-Pendergrass method 
by contrasting results from triple comparison testing with results ob- 
tained using paired comparisons. Two sets of data are discussed. One 
set is from margarine cocktail stick taste tests, where sensory fatigue 
is known to be marked. The other set is from toilet soap sniff tests, 
where sensory fatigue is much less of a problem. 
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2, BRADLEY-PENDERGRASS TRIPLE COMPARISON ANALYSIS 


The members of each of the (“) triplets formed by the set of ¢ 


treatments are ranked in order of acceptability by judges.’ Analysis 
proceeds as follows. 


The Model 


The model used is an extension of that successfully developed by 
Bradley and Terry for paired comparisons, (2, 4-6, 8, 11]. It is supposed 
that the treatments 7,,--:, 7: are associated with parameters 
mw, , +++, 1 Which satisfy the following conditions: 


t 
Bi = 0, SS tits, == Ls 
i=1 
POS Te) = ea aoe 
Ne im; id) ai m5 (tr; Sr Ura T(r; 2 5;); 

(AGA; 4 fick = nae 
where P(T; > 7; > T;) represents the probability that in the block 
containing treatments 7; , 7; , T;, , treatment 2 is rated top, treatment 
j central and treatment k bottom, in acceptability. 

Maximum Likelihood Estimators of Parameters 1; 


The maximum likelihood estimators p, , --- , p, of the parameters 
™,,°** , 7 are given by the equations below: 


a;/p; =n 2d {[2p.(p; + De) + D; “1s Del/ Dix}, 


1 ka 
Dp: = 1, lca Wy ant 


where a; = twice the total number of first places + the total number 
of second places given to 7; and 


Dix = pip; + pi) + Dilp: + pe) + pilp: + i). 
In most cases these can be solved by one or two iterations using as 
initial estimates the values of p; found from the quadratic equations: 


p[a(2? — 6¢ + 6) — n(2t* — 114° 4+ 222 — 19¢ + 6)] 
-+ p;[a,(2t = 6) —nt = Af’ +- i 2)] + 2a; = 0, t= ie Br cls. a 


1Extension of the analysis to the general case of mé,4 rankings on the triplet Ti, T;, Tx, nize > 0, 
i<j <hk,i,j,k =1°°*° t, is readily made and results quoted by the authors in {15}. 
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Variance—Covariance Matrix of Estimators p, 


Expressions for the variances and covariances are presented by 
Bradley and Pendergrass for large n. These are particularly awkward 
to compute, however, and the authors provide an approximation which 
ae suffice for practical purposes. If 7, --- m, are near equality, 
then 


Var (V ep) =8 — D/G — 9), 4 Sj. oH 
Cov Aap: Wipe 3/0 = 2) ee, a = 1, ee 
Goodness of Fit 


The model can be tested by comparing observed frequencies f;;, 
with expected frequencies f’;, . If f;;, = number of times treatments 
T; , T; , T,, were ranked in that order of acceptability, and 


fein = 7pp;) Dix ; t1#I# Kk, 2; 4; k= Ne orf Ae 
then 
(fiir tin) 
iAxi#k ine 
ay Tipe: 


is distributed as chi-square with Eq — ( — 1) | degrees of freedom 


for large n. 


Scaling of Treatments 


If the data fit the model, then the authors suggest that the treat- 
ments be regarded as lying at points In p, on a linear scale of accept- 
ability. This is by analogy with the paired comparison model of Bradley 
and Terry. In that case, the parameters 7; can be shown to be related 
to a metric on which the mean response for treatment 7’; is In 7; . 


Overall Tests of Main Effects and Interaction 


An overall test for main effects is available if required. [or large n, 


Z= an( 5) In6+2 D4; Inp; — 2n >> In Din 


5 t<i<k 


is distributed as chi-square with (¢ — 1) degrees of freedom. Further, 
if the experiment has been repeated over a number of different groups 
of judges or in different circumstances, interaction can be tested using 
the result that: Z° — Z is distributed as x*(g — 1)(t — 1) where 
Z = >2., Z.,g = no. of groups, Z, = Z above, computed for uth 
group, ¥™=1--: g. 
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Relative Efficiency of Paired Comparisons and Triple Comparisons 


Under the assumption that differences observed by judges between 
the three component pairs of each triplet are the same as those which 
would be observed in the equivalent paired comparisons, then the 
efficiency of triple comparisons to paired comparisons is 9/4 in terms 
of incomplete block rankings (or 3 in terms of treatment replications, as 
three assessments are made per triplet ranking against two per pair 
ranking). 


3. PAIRED COMPARISON ANALYSES 


As the various paired comparison procedures give closely similar 
results [1, 12], any one of them could be used as the basis for assessment 
of the triple comparison technique. However, that of Bradley and 
Terry is an obvious choice because of its relationship to the triple 
comparison method. In addition, the results from Scheffé's method [17] 
are given, as results from this method had to be obtained for routine 
reporting of the paired comparison legs. It must be expected, of course, 
that the results of these two analyses will agree more closely with each 
other than with the triple comparison results, as they are derived from 
the same basic data, whereas the triple comparison data is distinct 
and independent. 


TABLE I 


Resutts or Test I 


Paired Comparisons Triple Comparisons 
Statistic 

Scheffé B-Terry B-Pendergrass 
Response Ratings? or In p; In p; 
Product A —0.55 —0.42 —0.06 
Product B 0.22 0.28 —0.34 
Product C 0.36 0.438 0.40 
Product D 0.45 0.28 0.50 
Product E —0.48 —0.57 —0.50 
S.E. of Diff. +0.48 +0.52 +0.33 
Goodness of Fit X(6)— non x (6) son x-(46)) = 47.5 
Treatment Diffs. SA) = BI SED) Ss 7 x2(4) = 138.8 
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4. MARGARINE COCKTAIL STICK TASTE TESTS 


It was found that guidance on the likely results of full-scale con- 
sumer research studies could be obtained by taste testing small cylinders 
of margarine from cocktail sticks. Two such tests were done for routine 
purposes by paired comparisons and for experimental purposes by 
triple comparisons. 

Test I. In Test I, judges examined five margarines A-E. Each of 60 
judges made one paired comparison to give 3 rankings per ordered 
pair. Each of a further 120 judges made one triple comparison to 
give 2 rankings per ordered triplet. 

Test II. In Test II, judges examined three margarines L-N. 
Each of 138 judges made one paired comparison to give 23 rankings per 
ordered pair. Each of a further 126 judges made one triple comparison 
to give 21 rankings per ordered triplet. 

Paired comparison and triple comparison ratings are seen to be in 
close agreement on Test II. On Test I agreement is not so good, there 
being an indication that the methods give different measures of the 
relative acceptability of Products A and B. The goodness of fit chi- 
squares show that both of the paired comparison models and the triple 
comparison model gave acceptable fits to their respective data. 

On Test II, the relative sensitivity of the three procedures on a 
per ranking basis is indicated by the treatment chi-squares as they 
stand, for the number of pair rankings approximated the number of 


TABLE 2 


Resutts or Test II 


Paired Comparisons Triple Comparisons 
Statistic ee : 
Scheffé B-Terry B-Pendergrass 
Response Ratings? ai In pi In pi 
Product L —0.45 —0.38 —().44 
Product M 0.55 0.62 0.56 
Product N —0.10 —0.24 —0.12 
S.E. Of Diff. +0.46 +0.58 +0.41 
Goodness of Fit xt =10,0- 4x71) = 0.2 x23) = 1.3 
Treatment Diffs. x72) = 4.8 x22) = 3.2 x2(2) = 6.0 


2All sets of response ratings have been adjusted to total zero and range one. 
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triplet rankings. In the case of Test I, it is necessary to double the 
paired comparison chi-squares before comparing them with the triple 
comparison chi-square, as the number of pair rankings was only one 
half the number of triplet rankings. In each case, there is no indication 
that triple comparisons were more efficient than paired comparisons. 
It is of interest that application of Durbin’s chi-square test of treat- 
ment differences to the triple comparison data from Tests I and II 
gave values close to those found under the Bradley-Pendergrass analysis 
(4) = 13.4,x@) = 6.0). 


5. TOILET SOAP SNIFF TESTS 


Two toilet soap sniff tests were done for routine purposes by paired 
comparisons and for experimental purposes by triple comparisons. 
Test III examined four toilet soap perfumes A-D. Each of 132 
judges made one paired comparison to give 11 rankings per ordered pair. 

Each of a further 131 judges made one triple comparison. Although 
the design arranged for rankings to be divided evenly over the 24 
ordered triplets, an error in arranging record sheets resulted in the 
number of rankings varying as follows: (Order of presentation within 


Triplet 
ABC ABD ACD BCD 
No. of rankings 51 ig il 52 


triplet remained in approximate balance). Analysis of this test was 
therefore carried out using the extension of the method to the general 
case when the number of rankings differs between triplets. 

Test IV examined six toilet soap perfumes L-Q. Each of 180 
judges made two successive paired comparisons (involving four differ- 
ent products). As there was no evidence that results from pairs sniffed 
first differed from those for pairs sniffed second, the results for all pairs 
were combined to give a total of 360 rankings, 1. 12 rankings per 
ordered pair. Each of a further 120 judges made one triple comparison 
to give one ranking per ordered triplet. 

On both tests, paired comparison and triple comparison ratings 
are in agreement within the limits of error. 

On Test III, the paired comparison models fitted the paired com- 
parison data, but the triple comparison model was not a satisfactory 
fit to the triple comparison data (goodness of fit chi-square significant 
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TABLE 3 


ReEsvutts or Test III 


Paired Comparisons Triple Comparisons 
Statistic =n 
Schefté B-Terry B-Pendergrass 
Response Ratings Oi In pi In pi 
Product A 0.30 0.23 0.30 
Product B 0.44 0.46 0.55 
Product C —0.56 —0.54 —0.40 
Product D (0518 —0.15 —0.45 
S.E. of Diff.* +0.42 +0.50 +0.31 
Goodness of Fit 7) =O setG)) = OL x17) = 4001 
Treatment Difis. x73) =7.2 x7?(3) = 4.5 x7(3) = 16.5 


3An average value is given for the S.E. of difference of triple comparison ratings as this figure 
varies between pairs of products due to differing numbers of rankings per triplet. 


TABLE 4 


Resutts or Trest lV 


: Paired Comparisons Triple Comparisons 
Statistic 

Scheffé B-Terry B-Pendergrass 
Response Ratings ai In pi In p: 
Product L 0.30 0.37 —0.04 
Product M —0.55 —().45 —0.36 
Product N 0.45 0.30 0.16 
Product O —0.55 —0.68 —0.55 
Product P (0), Abs} 0.19 0.34 
Product Q 0.22 0.22 * 0.45 
S.E. of Diff. +0.38 +0.29 +0.21 
Goodness of Fit x10) = 5.7 x10) = 6.5 x2(95) = 76.7 
Treatment Diffs. x75) = 18.5 x7(5) = 18.5 x2(5) = 32.8 
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at 1 percent level). On Test IV, all three models fitted their respective 
data. 

Using the treatment difference chi-squares as before to compare 
sensitivity on a per ranking basis, it appears that, on both tests, con- 
siderably greater sensitivity was achieved with triple comparisons than 
with paired comparisons. (Note that on Test IV the number of triplet 
rankings was only one-third the number of pair rankings so that the 
triple comparison chi-square requires multiplication by three before 
contrasting with the paired comparison chi-squares). Application of 
Durbin’s test of treatment differences to the triple comparison data 
from the balanced test, Test IV, gave x'(5) = 31.5 in good agreement 
with that found under the Bradley-Pendergrass analysis. 


6. ANALYSIS OF VARIANCE-TEST III 


As the extension of the triple comparison analysis to the case of 
varying numbers per triplet was not available at the time, the triple 
comparison results from Test III were analysed by least squares [using 
dummy variables [16]]. As shown below, the results agreed very closely 
with those subsequently found by the triple comparison method. 


Products S.E. of Treatment 
A B C D Diff. x?(3) 
Triple Comparison 0.30 0.55>—0.40 —0.45 +0.31 16.5 
Least Squares 0.32 0.54 —0.40 —0.46 +0 .31 Avijenl 


7. DISCUSSION AND SUMMARY 


These results suggest that the model on which the triple comparison 
analysis is based will prove appropriate to at least a proportion of 
sensory tests. TI urther, that in those cases where sensory fatigue is 
not marked, it may afford a valuable gain in efficiency over paired 
comparisons. It is pointed out, however, that each of the experiments 
described above compared products which were basically similar and 
which differed essentially in only one dimension. 

In the author’s experience, paired comparison tests involving treat- 
ment differences of this nature rarely give results inconsistent with the 
various paired comparison models. Lack of fit tends to occur when 
treatments differ importantly in more than one dimension or when 
marked judge X treatment interactions are likely to be present. As 
the triple comparison model is likely to be more vulnerable than the 
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paired comparison models to the complications arising on these latter 
types of test, it may prove of limited applicability where treatments 
differ in kind rather than in degree. 
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RAPID CHI-SQUARE TEST OF SIGNIFICANCE 
FOR THREE-PART RATIOS’ 


CHar.es E,. Gates AND Bensamin H. Brarp? 
University of Minnesota, St. Paul, Minnesota, U.S.A. 


INTRODUCTION 


In the study of qualitative characters, the number of individuals 
distributed among mutually exclusive categories is commonly com- 
pared with a distribution predicated on genetic theory. If the observed 
numbers in the various classes deviate from the theoretical, it is de- 
sirable to be able to reject the hypothetical distribution at a determin- 
able probability level. A statistic normally used in ascertaining this 
goodness-of-fit is chi-square. Frequently, the number of such com- 
' parisons may be large and the computations necessary for each segre- 
gating family become laborious. In these instances an accurate, fast 
technique would prove useful. This paper presents a rapid, graphical 
procedure for testing goodness-of-fit to ratios with three classes. These 
graphical chi-square tests of significance are particularly appropriate 
for testing lack of fit to three-part ratios for those organisms where it 
is feasible to take random samples of some constant number of indi- 
viduals from segregating progenies. However, by constructing a series 
of graphs, it is practical to use this method with variable sample size 
if the number of uses is sufficiently large. Graphs to be used with 
the 1:2:1 ratio have been drawn for a variety of values of n and have 
been used successfully in the field for determining lack of fit. 

Although the justification for, as well as the illustrated examples, 
have been drawn from genetics, uses in other fields can be visualized. 
One particular application outside the field of genetics could be in 
evaluating the results of a sample survey where a response can be 
categorized into three mutually exclusive classes and one is interested 
in ascertaining the goodness-of-fit to a 1:1: 1 ratio. 


ILLUSTRATION 
Suppose the numbers of individuals falling in three mutually ex- 
clusive classes are x, , Y, and x; , where 


Me l= an ea = 1s (1) 


1Paper No. 4346, Scientific Journal Series, Minnesota Agricultural Experiment Station. 
2Crops Research Division, Agricultural Research Service, U. S. Department of Agriculture. 
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The critical region for a particular level of significance can be defined 
by an ellipse drawn on rectangular coordinate paper with 2, and 2, 
(or x;) the ordinate and the abscissa, respectively. Figure 1 was pre- 
pared for the 1:2:1 ratio, the three ellipses indicating demarcation 
between the .10, .05 and .01 levels of significance. 

To illustrate the use of this graph, suppose the segregates from 
individual F, plants from a cross of two barley varieties differing in 
number of rows of kernels on the spike are classified and counted. In 
a cross of this kind (two-row VV X six-row vv) the heterozygote (Vv) 
is phenotypically distinguishable from either homozygote and, if con- 
ditioned by one gene, the F, should segregate in a 1: 2:1 ratio. Con- 
sider the following examples wherein x, , x and «x; refer respectively 
to the numbers in the VV, Vv and wv classes: 


Example Ly L2 X3 n 
1 20 16 14 50 
2 16 22 12 50 
3 7 33 10 50 
4 15 33 2 50 


Locating the points of intersection of x, and x, (or x; and x,), as measured 
along the abscissa and ordinate, respectively, the probabilities from the 
x’ statistic can be immediately ascertained as .05 > P > .01, P > .10, 
10 > P > .05 and P < .01 for examples 1, 2, 3 and 4, respectively. 


THEORETICAL DEVELOPMENT FOR AN ARBITRARY 
THREE-PART RATIO 


Genetic theory or other considerations predicate that three classes 
corresponding to those indicated by (1) should have the proportions 
7, , 72 and 73 , respectively. It can readily be seen that 


E(x;) = nn; , Var (x;) = nx, — m,), Cov (a; ,2;) = —naj,r; . (2) 
It is well known (see, e.g., Cochran [1952 ]) that 


— — ai — 7] 
[ary os nm] hea 7) NT TW. | e es (3) 
NT Te nm(1 —. T2) 2 — NT2 
will be distributed asymptotically as chi-square with 2 degrees of 
freedom. Expression (3) may be re-written as 


(2, — nm,)°m(1 — m2) + 2mym(a, — Na ,)(%_ — M2) 


+ (a2 — nm)’, — 1)]/(neyroms). (4) 
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Now (4) is algebraically identical to the chi-square goodness-of-fit 
statistic 


3 


Xe = [((v; — nx,)"]/nr; = b vin, | —n (5) 


t=1 
indicating, incidentally, why the latter has its well known distribution. 


Equation (5) can therefore be seen to represent an ellipse, not in 
standard form, in the x, , x» plane, with the center at 


(nt, , Nt). (6) 


The acceptance region of the usual goodness-of-fit test is the region 
inside this ellipse. 

In order to plot ellipse (5), it is necessary to determine the slopes 
of the minor and major axis as well as their lengths. If the x, , x. axes 
are rotated so that ellipse (5) will be in standard form, the slope of 
the rotated x, axis is 


m=R+VR +1 (7) 
where 
R = (a — W2)13/20yr2 . 
Rotating the ellipse, (5) can be written 


[A(x = nm)’ /x2,a] oy [C(a2 ~— m2) /X2,a1 ll (8) 


where 
A = [mo(1 — m2.) + 2mmyr. + 2(1 — 1) m?)/nayre3(1 + m’), (9) 
C = [wo(1 — m2)m? — 2mm. + m,(1 — m)]/nmyret3(1 + m’). (10) 
The semi-diameters of the minor and major axes are therefore 
Mae A Wie YC, (11) 


respectively, where A, C > 0. 

The slope of the rotated axis, as indicated by (7), is independent 
of sample size, n, as well as significance level. On the other hand, the 
location of the center of an ellipse, as shown by (6), is a function of n, 
but is independent of significance level. The lengths of the major 
and minor axes, given by (11), are a function of both sample size and 
significance level. Hence a family of ellipses with a common center 
and slope can be constructed for given n by varying significance level. 
In using these ellipses, it is recommended that the expected number 
in any class be the customary minimum of 5 or 10° because the distri- 
bution of (3) is only asymptotically distributed as Ne 
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CONSTRUCTION OF ELLIPSES FOR THE 1:2:1 RATIO 


Substitution of 1/4, 1/2 and 1/4 for ™ , 7, and =; , respectively, 
yields the following basic quantities: 


R = —1/4, m = 0.780776, 1+ m’ = 1.609612, 
A = 11.123103/n,  C = 2.876897/n. 


Making use of (6), the center of the 1 : 2:1 ellipse, is located at 
(n/4, n/2). (12) 


Substituting the basic quantities in (11), one-half the lengths of the 
minor and major axes are found to be 


2998-V/ nx... and .5896Vnx2,0 ; (13) 


respectively. An ellipse can be readily constructed for each m in the 
range of interest using a Dietzgen ellipsograph or various simple, but 
accurate construction techniques such as the concentric-circle method 
for major and minor diameters. French and Vierck [1958], a recent 
reference, demonstrate various methods of ellipse construction. 

To illustrate the procedure more specifically, consider ellipses for 
nm = 50, where the critical regions are defined by the 0.10, 0.05 and 
0.01 levels of significance. Making use of (12) and (13), one obtains 
These ellipses were plotted in Figure 1. 


Significance Slope of Location of center Semi-diameter length 
level rotated 
21 axis Xi Xe L1 L2 
10 781 12.50 25.00 4.55 8.94 
.05 .781 127250) 25.00 3220 10.22 
OL meu 12.50 25.00 6.48 1265 


CONSTRUCTION OF ELLIPSES FOR OTHER THREE-PART RATIOS 


Calculation of ellipses for defining critical regions for other three- 
part ratios can be carried out in analogous fashion to the ellipses for 
the 1:2:1 ratio. To eliminate the necessity of deriving intermediate 
calculations, the slopes, center, and lengths of major and minor diam- 
eters for some of the more common three-part genetic ratios and the 
1:1:1 ratio are given in Table 1. These quantities are expressed in 
terms of arbitrary n and significance level. To obtain figures for graph- 
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38 
36 
34 
32 
30 
28 
26 


x 
2 24 


0 2 4 6 8 lO 2 14" 16. We Neo “220824: 
X, or Xs 
FIGURE 1 
x? TEST FoR A 1:2:1 Ratio 
Z1, x2 and x3 are the observed numbers of progenies in the VV, Vv and » classes, respectively. 


Intersection of x1 and x2 (or xz and z2) outside an ellipse indicates significance at the P = .01, .05 and 
-10 levels of significance for sample size n = 50. 


TABLE 1 


Stopes, CENTERS AND Semi-DrAMETERS OF ELiipses DEFINING CRITICAL 
REGIONS FOR VARIOUS THREE-PART Ratios. 


Genetic Slope of minor 
Ratio* axis of ellipse Center Length of semi-diameters 
relative to 71 
axis ai L2 Ly Xe 
3:9:4 0.650 3n/16 9n/16 0.2890t 0.5616 
6:9:1 0.973 3n/8 9n/16 0.1710 0.6720 
3:9:3 0.721 n/5 3n/5 0.2710 0.5710 
3:9:1 0.895 3n/13 9n/13 0.1860 0.5970 
eal 0.883 3n/16 3n/4 0.1680 0.5580 
1:14:1 0.638 n/16 7n/8 0.1546 0.3800 
12 0.781 n/4 n/2 0.3000 0.5900 
eile 1.000 n/3 n/3 0.4080 0.4080 


*7. was written as the largest component of each ratio. This is somewhat at variance from usual 
genetic terminology. 
t+non-genetic 


te =V/n X22 
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ing, it is necessary only to substitute and x3,4 and to plot the ellipses 
as previously described. For three-part ratios not shown, it is necessary 
to substitute appropriate values for 7 , 72 , and 7; in (6), (7), (9) 


and (10). 
It would be possible to rewrite (4) in terms of the proportion 
p; = x,/n, resulting in corresponding changes in the above results. 


For a given genetic ratio and significance level, a family of ellipses 
could then be constructed with common center, the larger samples 
leading to the smaller ellipses. Such graphs would prove in some re- 
spects more convenient to use than the procedure outlined above. 
However, this convenience would in some instances, be offset by the 
requirement of the calculation of two of the p,; in order to determine 
the level of significance. The decision as to the better procedure de- 
pends on whether it is more desirable to have a minimum of separate 
graphs, but require supplementary computations, or to have more 
graphs with no computation necessary. Separate ellipses in either 
instance need not be constructed for each n as ellipses for 7 of similar 
size will be similar. 

We are indebted to a referee for pointing out a more elegant approach 
to the derivation of the characteristics of the chi-square ellipses and to 
Donald Richter for helpful comments. 
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GENERATING UNBIASED RATIO AND REGRESSION 
ESTIMATORS 


W. H. Witirams 


Bell Telephone Laboratories, Incorporated 
Murray Hill, New Jersey, U. S. A. 


1. INTRODUCTION 


Information collected on a concomitant variate is often used in 
finite sampling theory to create more precise estimators of population 
characteristics. This supplementary information is obtained in addi- 
tion to the characteristic under study and some aspects of it may be 
derived from sources other than the sample itself. It may be either 
qualitative or quantitative. For example, suppose that the variate 
under consideration in a sample survey is the number of dairy cattle 
per farm y and that at the time of the survey the number of grazing 
acres per farm z is also obtained. It may then be known from census 
data that the total number of grazing acres in the entire area is Nux 
and the mean per farm is yx . Analytically, we have a random sample 
of n pairs (y; , 4), 7 = 1, --- , m, from a population of size N and the 
population z-mean is known exactly. The problem is to estimate the 
population mean py . 

A general class of estimators designed to utilize this supplementary 
information includes ratio and regression estimators. These estimators 
are described in textbooks on the subject, see for example Cochran 
[1953]. Additional developments have been presented by Hartley and 
Ross [1954], Nieto [1958] and Robson [1957]. 

The two classical ratio estimators are the ratio of means estimator 
g = (G/®)ux and the mean of ratios estimator 7 = ux >o7_, 7:/n where 
g and & are sample means and r; = y,/z;. It is well known that these 
estimators are biased. The usual regression estimator is obtained by 
evaluating the least squares line of best fit y = y + b(« — «) at the 
point ux giving 9, = 7 + b(ux — #) asa regression estimator of py . 
This estimator is biased if the assumption of a linear model is not valid. 

The generation of some exactly unbiased ratio and regression esti- 
mators is discussed in this paper. Specifically, we classify an estimator 
as of the regression type if it is invariant under location and scale 
changes in x and if it undergoes the same location and scale changes 
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as the y variate. A ratio estimator has analogous properties but for 
scale changes only. 


2. DERIVATION OF UNBIASED ESTIMATORS 
FOR SIMPLE RANDOM SAMPLING 


To generate unbiased estimators, consider the following sampling 
procedure. At step one, select with equal probability one of all possible 
splits of the population into s mutually exclusive groups of size n/k, 
i.c., N = sn/k. At the second stage, select randomly without replace- 
ment k of the groups from the total number of groups s of that particular 
split of the population. This gives a sample of size n. 

Now consider the conditional distribution for a particular set of s 
groups. Attached to each of these groups there are characteristics” 
Go SEO Oa 1, Sr 5s wenere gj‘? and & are means of the n/k 
units in the group and 6" is as yet an unspecified function of the y and 
a of that group. Fora given split and a random selection of groups, the 


expectations of g° and e,7=1,-::,k, are wy and px respectively; 
that is, they are conditionally unbiased. Furthermore, 
(i -) 4 po - be -@ () 
SS k(k = 1) i=1 5 
is an unbiased estimator of Cov (6, #) where 6 = Die: b’/k. 


Hence if g = 9 + b(ux — #) then E(g) = uy — Cov (b, £) and 


k 
T= 9+ ue-a+(i-2) gia De-oe-9 @ 
is a conditionally unbiased estimator of wy . It is then unbiased un- 
conditionally. 

This approach is valid for any defined form of the coefficient bee 
T,, will remain unbiased. If 6‘ has a form which is invariant under 
linear x and y transformation (say least squares form) then T’, is classi- 
fied as a regression estimator. If b° = g‘/z (say), then 7, falls 
into the class of a ratio estimator. 

This procedure is used to generate the unbiased estimators; in 
practice a simple random sample would be drawn and to compute 7, 
it would be split randomly into groups, see Section 7 for an example. 
The latter operation is equivalent to the generating procedure which 


allows a particular split-sample to arise in (*) (N — n)!/[(n/k)°* 


1]t is assumed that this relationship is true in terms of integers. 
2Superscripts will be used to specify the groups. They will be used with parentheses when the 


reference is to the entire population of s groups and without parentheses when referring to the sample 
of & groups. 
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ways while splitting the simple random sample allows a particular 
split-sample to arise in only one way. The unbiasedness is preserved 
by either procedure. 

The argument is easily generalized to p auxiliary variates. 


3. SPECIFIC ILLUSTRATIONS IN SIMPLE RANDOM SAMPLING 


A form of interest is 


po? sae = : = ibe ar eensh (3) 


the least squares slope form. 

In this case 7, bears much resemblance to 7, and might be thought 
of as possessing an additional component which is required to compen- 
sate for possible bias in J, . This is not exactly true of course, because 
the first two terms of 7), are not exactly the two terms of 7, . However, 
in this case (3), it seems natural to make some remarks on the efficiency 
rey aad ane 

The variance of 7,, depends very much on the form of the b‘ 
coefficients. In fact, until the form of b‘” is specified little can be 
said about the variance of 7;,. One can imagine choices which would 
lead to poor efficiency indeed. However, 7’, in this case has coefficients 
in the least squares slope form and it is natural to ask how it compares 
with 7, when a linear model is assumed, for then %, has optimum variance 
properties. But with this assumption, g, also possesses unbiasedness 
and the advantage of 7’, is unbiasedness in situations in which % is 
not unbiased. However, one would like the efficiency of 7’, to compare 
favorably even in this linear model case. So by assuming a linear 
model and a normal z-distribution, it is easily found that V(gj,)/V(T;.) = 
(n — 2)(n — 6)/(n — 3)(n — 4), k = 2 andn > 6. This expression 
is less than one but approaches one as 7 gets larger and, for example, 
when n = 15, 25 is equal to 0.89 and 0.95. Thus we see that one does 
not lose all the efficiency brought about by the use of an auxiliary 
variate and that [V(T,) — V(G)l/V(%) is O(n"’). 

Furthermore, the role of k also depends upon the choice of the b“”. 
For example, in the spécial case of the previous paragraph, if the number 
of groups is regarded as variable, V(g,)/V(T,,) will be found to have 
a maximum at k = ~/n/3. Thus for this form of the b“’, the optimum 
number of groups is ~/n/3. Other forms of the b“ would yield other 
results. 

Another possible choice is 
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n/k n/k 


0 = 
7=1 


7=2 
In this form 7; is a ratio estimator and it is unbiased even if the linear 
relationship of y and x does not pass through the origin. But character- 
istically the variance will be inflated by such a relationship. 


Next, if db = 9/2 =r, T, will reduce to the form 
Nk—n 
= , - a a 1] — ry 4 
T TUX N(k 1) (¥ Fz) ( ) 


where 6 is denoted 7. It will be noted that when k=n, T, = y’, the 
unbiased ratio estimator presented by Hartley and Ross [1954]. The 
efficiency of this form of 7, has been examined in detail by Goodman 
and Hartley [1958] and Robson [1957]. Robson presents an exact 
variance formula for finite populations. 

Finally, consider b°? = r =. (k/n) Se te hp Sal 2 een 
6 =F = >.*., 7;/n which does not depend upon the particular split 
of the population. Now if, after substitution of this form into T;, , 
the estimator is averaged over all possible splits of the sample into 
groups of size n/k it will be found that the result is again the Hartley- 
Ross unbiased ratio estimator. This averaging process is indicated 
by a star, ie., T% . 

Other forms could, of course, be considered. 


4. STRATIFIED SAMPLING 


Since a bias may be magnified relative to the standard deviation, 
stratified sampling may perhaps be regarded as the most important 
application of unbiased estimators. Their separate use within strata 
requires exact knowledge of the population strata means but is straight- 
forward. We now develop a combined stratified estimator. 

Consider LE strata of size NG-, § = te ee within, ee 
and again consider the sampling in two stages. At the first stage select 
with equal probability one of the possible splits of each stratum into s 
eroups of size n,/k,t = 1, ++, L. Then N, = sn./k. At the second 
stage select k groups with equal probability and without replacement 
ah each of the strata, giving a sample of size n, in the ¢-th stratum, 

MP hee (i 


For a given split and a random selection of groups 
L L 
Gn = ye (N./N)j: and Z, = »3 (N./N)z: 
t=1 t=1 


are unbiased estimators of yy and ux respectively, where 9; and @; 
denote means of the i-th group in the ¢-th stratum. Also we can consider 
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a coefficient b{;’ which is as yet unspecified in form but utilizes the set 
of elements in the 7-th group of all strata. For example, 


L nt/k 


Ew —= Ge (tz — &,°) 


is an over-all slope estimator. 
Next we note that 


= 2d (N./N)9. = 2 G:./k (6) 


where g, is the mean of the n, observations in the ¢-th stratum 
(similarly for #,,) and finally that a conditionally unbiased estimator of 
Cov (6.: , £.:) is given by 


k 


(1 a NT ee Te a 1) d =. aA) CEP = BE 


Consequently, if g = gj. + 6.(ux — @.) then EY) = py — 
Cov (6., , €.,) and therefore 


Doss = Yet == be (ux Taal ees 
n il ee a ; = 
+(1-2) be Ye sn0 8) @ 


is a combined stratified unbiased estimator of yy . Note that since 
N, = sn./k, k/s = n/N. Nieto [1958] discussed the efficiency of the 
estimator (7) (for sampling with replacement) in detail. 
Again the generalization to p auxiliary variates is straightforward. 
As a specific illustration consider the case in which : 


Se = Tyee ee — es 
Then 7,.,,,; reduces to 
7 Nk Bcc) 
Tecst) = Fethx tie cay Ge - RAL (8) 
In the special case that VN, = N,n, = “for alltandk = 7,s = N then 
V-1 Os 
T x(t) = Faux + 2S (Ger =e Pilea) (9) 


which is a generalized Hartley-Ross estimator. 
Finally, we again consider an averaging of 7’, over all possible splits 
of the sample into groups of size n,/k, t = 1, --- , L. For this, the 
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Y . ‘ i Ci L w(t) 7 
coefficient is taken in the form BS? = 177 = Ye Wor, where 
ro? = (k/n,) D4 (y,/a;). Therefore, 


k L nt L 
Py vi /k = De (N./N) iD (y,/2) > a (N./N)F. 

al t=1 j=1 ua 
and some algebraic reduction will show that Tx:.1) averaged over all 
possible splits is equal to 

L Pi je Fhe 

es ms aif n N; (G ¥z FX) 
TFs eat] Os + (Ber vo ipGee) SF (1 = 2) 2 N? (n, = 1) ) (10) 
which does not quite reduce to a form similar in appearance to Equa- 
tion (8) and the Hartley-Ross estimator. 

As before other selections of coefficients will yield other unbiased 
estimators. 


5. MULTISTAGE SAMPLING 


We consider a population with N primaries of equal size M and the 
following sampling scheme. First select n primaries from the N avail- 
able with equal probability with or without replacement. Then select 
with equal probability one of the splits of each of the primaries into s 
groups of size m/k. Then with equal probability and without replace- 
ment draw k of the groups so that the sample size is ™ in each selected 
primary. 

Consider now the conditional distribution for a fixed set of primaries 
and a fixed split of the primaries into s groups each. Then by Section 4, 
Equation (11) is an unbiased estimator of Y, , the population mean of 
the n selected primaries. 


TsO Os B(x a ao (1 a | ea De (b' a b)(# 0) (11) 
where 


n  m/k 


f= Gam Sys, 9= 0) Do = Om YY ve 


and similarly for xz. The coefficient b“ is again arbitrary in form. 
Finally, the expectation of T,¢s over all possible primary selections 
is the average of Y, over all possible primary selections; this is uy and 
Ta) i8 unbiased in multi-stage sampling. 
Again the selection of the coefficients yields estimators of different 
types. For example, an unbiased ratio estimator of the Hartley-Ross 
type generalized to multistage sampling can be obtained. 
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6. VARIANCE ESTIMATION 


It is interesting to notice that the same two-stage sampling scheme 
can be used to form an estimate of the variance of T,. First assume a 
negligible n/N (or k/s) and a fixed set of uncorrelated groups. 7, can 
now be written 


k 
T= 9-H Le — ws) (12) 
and its conditional variance can be expressed in terms of the variances 
and covariances of the components in (12). Since 7, is conditionally 
unbiased this variance has expectation equal to the over-all variance. 
Substitution of unbiased estimators for each of the terms of the variance 
(plus some terms of zero expectation) yields (13) as an unbiased esti- 
mator of the variance of T, . 


1 Ee 
o(T;,) = Ti — ea 2 vy. (13) 


Although this procedure is unbiased it can be subject to high sampling 
error, particularly for small k. 


TABLE 1 


A SmmpLe EXAMPLE OF THE ESTIMATORS 


T2 
Sample Pairs in 
Number Sample 7 Split 1 Split 2 Split 3 Ts 
1 P,P2P3P.4 3.500 ff ey 6.667 6.500 6.778 
2 P,P2P3P; 5.250 8.917 8.250 7.917 8.361 
3 P,P2P3P. 7.500 11.000 10.167 9.667 10.278 
4 P,P2PsP5 8.750 11.917 10.250 9.917 10.694 
5 P,P2P.4P3; 6.500 10.000 8.667 8.500 9.056 
6 P2P3P4P5 7.500 8.167 7.667 7.500 7.778 
i P2P3P5P¢ 11.500 10.000 8.667 8.500 9.056. 
8 P2P3P.P 5 9.750 9.417 8.750 8.417 8.861 
9 P,P3PsP5 7.250 9.417 8.750 8.417 8.861 
10 PiP;3P4P¢ 9.500 11.000 10.167 9.500 10.222 
11 P,P3P;P¢ 11.250 bah 9 brf 10.250 9.917 10.694 
- 12 P3P4P3P¢ 13.500 NG 6.667 6.500 6.778 
13 P,P.PP 12.500 11.000 10.167 9.667 10.278 
14 P.P4P5P¢ 12.750 8.917 8.250 7.917 8.361 
15 P,P2P5P 5 10.500 13.167 10.667 10.500 11.444 


ee eee eS 
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7. NUMERICAL ILLUSTRATION 


To illustrate 7, a small population consisting of the six pairs 
P) = Gj a), 0 S12, acs On ge and 2. =— 0; 162 eee 
was completely examined. Table 1 presents the values of g, T, and 7% 
[using Equation (3)] for all possible samples of size four. For T, , each 
of the possible samples was split into two groups of two in all possible 
ways, and the value of 7, was computed for each. The three distinct 
values of 7, for each sample are presented in the table. The numbering 
of the splits within a sample is of course arbitrary. It is readily veri- 
fied that the average value of each of 7, T, and 7% is the population 
mean py = 9.167. Furthermore, the exact population variances of 
gj, T, and T% are 7.914, 2.281 and 1.886 respectively. 

As a second example, the six pairs (y; , z;) were taken as follows: 
(02); -(1, 3), 2, 5), 4,97; 14), Op 15). All possible samples of size 
four were drawn and for each sample %, y’, % , T, (for all possible 
splits) and 7* were computed. A summary of the computations 1s 
presented in Table 2. 


TABLE 2 


ILLUSTRATION OF RELATIVE EFFICIENCIES 


Estimator 
j y’ Ge Ti 1% 
Expectation 3.937 4.000 3.961 4.000 4.000 
Bias —0.063 0.000 —0.039 0.000 0.000 
Variance 0.120 0.233 0.027 0.033 0.022 
Mean Square Error 0.124 0.233 0.029 0.033 0.022 
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A BIOMETRIC THEORY OF MIDDLE AND 
LONG DISTANCE TRACK RECORDS’ 


Matcorm E. Turner anp ELpanor D. CAMPBELL 
Medical College of Virginia, Richmond 19, Virginia, U. S. A. 


1. Introductory Remarks. 


It has been often observed that a plot of average velocity against 
distance or time for world track records suggests a continuous monotonic 
underlying relationship. Several attempts to discern the nature of this 
relationship empirically by seeking linearizing transformations have 
been made with varying degrees of success. The purposes of this 
paper are to describe an attempt to explain the observed curve by a 
biometric theory and to consider the problem of estimating unknown 
parameters occurring in the theory. The theory is applied to world 
record data of 1959 and records which are below par are pointed out. 
The development is based on the work of A. V. Hill [1927]. 


2. Biometric Theory. 


Consider the situation in which a runner runs a given distance x in 
a certain time ¢ at a constant speed &, this constant speed not necessarily 
being the runner’s maximum speed. We further define: 


y, the total energy (measured as volume of oxygen) excess over 
resting required to run the given distance with the given con- 
stant speed, 

% , the greatest speed for which the energy (oxygen) required is 
not greater than the energy available for an indefinitely long 
period of time. Thus, at speed % the energy required is equal 
to the maximum amount of oxygen which can be brought into 
the body from the atmosphere, 

r, the maximum excess (over resting) rate at which oxygen can be 
brought into the body from the atmosphere, 

D, the maximum tolerable oxygen debt beyond which complete 
exhaustion is manifest. 

The total excess energy y is a function of the constant speed « and 


the time of the race ¢. We assume that the rate of consumption of 
energy (oxygen) dy/dt is a function of ¢ but not of t. Thus, 


1Read before the spring meeting of ENAR, April 1960, and based in part on a Ph.D. thesis super- 
vised by Robert J. Monroe and H. L. Lucas, Jr., North Carolina State College, 1959. 
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dy/dt = f(e) and y = f(t. 


The constant of integration is taken to be zero since no excess energy is 
required to run a race of zero duration. 

The function f(z) is of unknown form; however, we expect it to have 
at least the following characteristics: 


(1) f(z) is a non-decreasing function of @. 
(2) f(z) has lower-order derivatives which vanish at the equilibrium 
velocity Zo, 


(3) feo) = 7. 


The first characteristic has been experimentally demonstrated by Hill 
[1927]. The second characteristic, although not experimentally demon- 
strated, seems plausible by analogy with other physiological equilibrium 
phenomena. It often happens that in the neighborhood of a point of 
equilibrium (here there is an equilibrium between the energy being 
consumed and the oxygen being brought into the cells) that com- 
pensatory physiological mechanisms come into play. It may be that 
circulatory changes or viscosity changes (due to temperature changes) 
provide such compensation in the present case. Whether or not these 
mechanisms are plausible, it seems appropriate to require characteristic 
(2) in the absence of experimental fact since one is hard pressed to 
find any physiological example not possessing this characteristic. 
Characteristic (3) is implied by the definition of # and r. 

Perhaps the simplest function possessing these characteristics is 
the following: 


i@) =r+ b(é — &)*, £2 to. (1) 


Then we have 
y = rt + (4 — 4)'t. (2) 


Equation (2) van be used to describe the experimental data shown in 
Figure 11 of Hill [1927]. Hill measured directly the excess oxygen 
required by an athlete to run for given periods of time at constant 
speeds. This experiment, of course, does not correspond to actual 
contest conditions, but does serve to illustrate the relation (2), which 
was not given by Hill. The meaning of the relation is clear. If a 
runner runs at his own asymptotic speed @ , then the excess energy 
required is obviously proportional to the time run where the proportion- 
ality constant is 7; but, if he runs faster than @, , then the energy required 
will be increased monotonically with the divergence of his speed from 
the asymptotic speed. 
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Now, the speed cannot be increased indefinitely since there is an 
upper bound on energy available. This upper bound is given by 


) Pee A ee (3) 


For all but the shorter races the time lost in attaining optimum 
speed is negligible and it can be seen that the best time for a race can 
be obtained by maintaining a nearly constant speed throughout the 
race. It may be supposed that records are broken when a runner with 
a great amount of available energy is able to select that constant speed 
which equates the maximum available energy to the required energy. 
In this case 


ri + D=rt + 4 — &)*t, (4) 


which upon cancelling rt from both sides and rearranging gives an 
equation relating speed to time run. Thus, 


& = &) + (bt/D)™. (5) 


The constants # , D, b, and k are constants pertaining to the indi- 
vidual runner. D is the maximum oxygen debt the runner can tolerate 
and b and & relate to the mechanical efficiency of the runner’s body, 
ie., to the amount of work required to overcome the viscosity of the 
muscles. z, is a function of the rate at which oxygen can be gotten 
to the tissues. Relation (5) could be used to study the progress of an 
individual runner in his period of training; however, our interest at 
present is in characterizing the world record curve. Although a» , 
D, b, and k are personal parameters, it may be supposed that, for world 
record holders, <, and D approach the human maximum for all and 
that b and & are similarly close to the human minimum for all and that 
therefore it is appropriate to treat  , D, b, and k as constant parameters 
for all record holders as a first approximation. Equation (5) is a special 
case of the single process law described by Turner [1959]. 


3. Estimation Theory. 
For convenience we change the notation. Let: 
Y= .3) 6 = —1/k, 
a Xe 
B = (b/D) ”* = (D/b)'”. 
Then (5) may be rewritten: 


Y =a+ Bx’. (6) 
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Now, (6) cannot be expected to hold exactly for world records but 
it may be considered to be the limiting curve which is approached by 
the successive records in each event. Thus, for the actual observed 
records, we may write 


Y =e Be ee (7) 


where ¢’ < 1. Perhaps, the limiting curve (6) itself is not absolutely 
stable since the human species is possibly changing in its athletic 
capacities due to nutritional and genetic effects. However, these 
changes may be regarded to take place slowly as compared with the 
rate at which records are broken. 

At any point in time we will assume as a first approximation that 
the distribution of world records below its asymptotic value can be 
described by the truncated hyperbolic distribution? 


COS OR SESS I (8) 


If we substract a from both sides of (7) and take logarithms we 
obtain the linear form: 


log (Y — a) = log 68 + 6 log X +, (9) 


where « = log «’. Now it will be seen that the additive error ¢ follows 
the rectangular distribution: 


fe=l/p, =pses0: (10) 


Thus, the problem of estimation is reduced to a well known one: to fit a 
straight line when the errors have a rectangular distribution. Maxi- 
mum likelihood estimates are readily found by minimizing the maximum 
error (see Turner, [1960]). The procedure is as follows: 

(a) Choose a trial estimate of a and then plot log (Y — a) against 
log X. 

(b) The line with minimum maximum error will either (i) pass 
through one point such that all other points fall below and such 
that two points will have the same maximum error, or (ii) pass 
through two points such that all other points fall below and 
such that one point will have the maximum error. 

(c) In ease (i) the line will be parallel (and hence have the same 
slope) as a line passing through the two points with maximum 
error. The intercept is then found by passing a line with the 
same slope through the upper point. In case (ii) both slope and 


2The truncated hyperbolic distribution was chosen as being, perhaps, the simplest distribution 
producing high contact on the right and having a finite upper limit. Since the upper limit is the ebject 
of estimation, the lower portions of the distribution are relatively irrelevant. 
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intercept are found for the line passing through the two upper 
points. For either case the maximum error is readily ascertained. 
The choice of case (i) or (ii) is quickly made graphically and 
other determinations are made analytically. 

(d) The above three steps are repeated for other choices of a and 
then, from the relation between minimum maximum error and 
a, one interpolates for the maximum likelihood estimate of a. 
Repeating the above three steps one final time with the optimum 
value of a leads to maximum likelihood estimates for 8, 6, and p. 


The above procedure illustrates a general theorem concerning mini- 
mum maximum error approximation by polynomial equations due to 
Vallée-Poussin [1911]. 


4. Fit of Theory to World Track Records. 


The 1959 World records for all events between 400 and 10,000 
meters (including English distances of 440 yards to six miles) were 
used to fit the theory. These data are given in Table I. Shorter 
distances and longer distances bring into play mechanisms unaccounted 
for in the theory, such as delay due to starting, muscle stiffness, blisters, 
nutritional requirements, etc. 

Performing steps (a), (b), and (c) we obtain the following results: 


p 
a Minimized 
Maximum Error 


5.08 0.03249 
5.09 0.03190 
5.10 0.03130 
5.11 0.03143 
9.12 0.03183 


Parabolic interpolation between the middle three values in the above 
table yields the value @ = 5.103. Refitting the straight line gives the 


estimates 6 = —0.4451, 8 = 20.44, and 6 = 0.03115. We then have 
the estimated limiting curve. 
& = 5.103 + 20.44¢°**", (11) 


where the estimated asymptotic value of the speed is % = 5.103. The 
estimated limiting speeds have been calculated according to equation 
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(11) and are shown in the last column of Table I beside the actual 
record speeds. Both record and estimated speeds are also shown in 
Figure 1. Although no very long races have been included in the analysis 
it is interesting to compare the speed for 30,000 m., ¢ = 5.26, with the 
estimated asymptotic speed, #) = 5.10. 

Estimated times é may be readily found by dividing the distance a 
by the estimated speeds %. These values are shown in column (4) of 
Table I and the differences between these estimated limiting times and 
the actual record times are given in column (5) of the same table. The 
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OBSERVED AND ESTIMATED SPEEDS FOR VARIOUS RECORD TIMES 


extent of the agreement is almost startling. The largest discrepancy is 
seen to be just 14.8 seconds for the six-mile run. The time for the mile 
run appears to be relatively good, along with the 400, 5,000 and 10,000 
‘meter races. With the exception of the mile run the “English” distances 
tend to lag behind the corresponding “‘metric’”’ distances. Perhaps, 
this fact reflects nothing more than the popularity of the various dis- 
tances, but one is tempted to speculate that large improvements In 
those races which lag behind may be forthcoming. 
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ADDENDUM 


Since this analysis was completed Sweden's Dan Waern lowered 
the 1000 m. record to 2’ 17.8’’ and Australia’s Herb Elliott lowered the 
1500 m. record to 3’ 36.0’... These are both above the estimated opti- 
mum times for these distances and hence the analysis is not affected. 
However, the fit is substantially improved. 


SMALL SAMPLE BEHAVIOR OF SLOPE ESTIMATORS 
IN A LINEAR FUNCTIONAL RELATION? 


Martin Dorrr 
University of Maine, Orono, Maine, U.S. A. 
AND 
JOHN GURLAND2 


Mathematics Research Center, U.S. Army, University of Wisconsin 
Madison, Wisconsin, U. S. A, 


1. INTRODUCTION 


This paper is concerned with estimating the slope parameter in a 
linear functional relation between two variables which are not observ- 
able since both are subject to error. Such problems occur in many 
contexts. As a matter of fact, in virtually all applications of fitting a 
functional relation errors of observation occur in all variables. Very 
frequently we can neglect errors in the independent variables in com- 
parison with the errors in the dependent variable, but clearly we cannot 
always do so. In particular, it is often desirable to use a relatively 
inexpensive technique in place of a very costly one, where the results 
given by the two, apart from errors, are linearly related. For example, 
the evaluation of rocket grains by static test techniques is relatively 
inexpensive compared to the procedure of utilizing dynamic or flight 
testing. 

The following linear relation is assumed between the variables X, Y. 


Y=a-+ px. 
Neither X nor Y can be observed, but only x and y, where 
xr=X+e 
See Mos 
e and f are independent random variables with expectation zero repre- 
senting errors of observations; X and Y are the expected values of x and 
y respectively. When for each X,(Y;) we have a single observation 


z;(y;), we shall say that we have unreplicated observations on that 
variable; otherwise we have replicated observations. 


1Preparation of this paper for publication was supported in part by contract No. DA-11-022- 
ORD-2059; research supported in part by Air Force Office of Scientific Research, Contract No. AF 
49(639)43, : 

2On leave from Iowa State University of Science and Technology, Ames, Iowa, U.S. A. 
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In a previous paper [3] we considered the large sample properties of 
various estimators of the parameters a, 8. In the present paper we 
consider the small sample behavior of these slope estimators in order 
to compare them. First however, we take up the small sample behavior 
of slope estimators which do not require replicated observations; such 
estimators were discussed only briefly in our earlier paper. 

We have chosen to examine the bias and the mean square error of 
the various estimators, partly because of the innate interest which 
these properties hold and partly because they seemed most amenable 
to attack. In all the cases considered the square of the bias is small 
compared with the mean square error; consequently our findings con- 
cerning the latter are essentially valid for the variance as well. 


2. ESTIMATORS WHICH DO NOT REQUIRE REPLICATION 


The problem of estimating the parameters of a linear functional rela- 
tion when both variables are in error and there is no replication has long 
been regarded as an intractable one. In 1940 Wald [10] suggested that, 
when the number of observations n is even, one can divide the observa- 
tions into two equal groups and take as an estimator of B 


by = (Su, as 4) (ee ae 5 a): 
Ant gn+1 1 

He showed that by is a consistent estimator of 6 provided that the 

partition of observations into two groups can be carried out inde- 

pendently of the errors and provided that limit inferior of 


Cee te ae + X;,) —— (Xie oe a + X,) |/n 


is positive. If the errors in the x; are small enough so that partitioning 
the observations according to the magnitudes of the x; yields the same 
two groups as obtained by partitioning the observations according to 
the magnitudes of the X, , the first condition is satisfied. The second 
condition guarantees that the expectation of the denominator does not 
vanish. 

Nair and Shrivastava [8], Bartlett [1], and others [4], [9], have shown 
that, for certain spacings of the X, , more efficient estimators of 6 can be 
achieved by dividing the observations into three groups and omitting the 
middle group. In particular, Bartlett suggested that, when the number 
of observations is divisible by three, one should take as an estimator of B 


n gn n gn 
1 (Su- £ 0) /(Be- £2) 
ant ant 1 


This is a consistent estimator of 8 under the obvious modification of 
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Wald’s conditions. Bartlett showed that b,; is more efficient than by 
when the X; are equally spaced. 
Housner and Brennan [6] suggest that the estimator 


by = 2G — Dy./ dG — De, , 

where 7 now denotes the order number when the z’s are ordered according 
to increasing magnitude, will often be more efficient than any of the 
foregoing. In fact they give empirical results to substantiate the belief 
that by is more efficient than by when the X; are equally-spaced. It 
should be realized, however, that consistency of this estimator rests 
upon more stringent assumptions than those made by Wald and the 
other writers mentioned above. One must now assume the entire 
ordering according to the magnitudes of the x; to be identical with the 
ordering according to the X, . 

It is clear that all of the estimators mentioned above are members 
of the class bt = >> wiy:/ >.> w,«; , where 7 denotes the order number 
when the 2’s are ordered according to increasing magnitude and 
> w; = 0. One requires this restriction on the w; in order to ensure 
that b; be invariant with respect to translations of the coordinate axes. 
In our investigation of b; we shall make the assumption that the ordering 
according to the magnitude of the x; isidentical with the ordering accord- 
ing to the X;. If that is so, the w; are simply constants, and by, is the 
ratio of two linear forms in z; and y;. In some fields of research such an 
assumption might be unrealistic, but in the physical sciences it would 
ordinarily be regarded as quite reasonable. In any event, occurence 
of situations where the assumption is reasonable seem sufficiently 
frequent to justify a detailed investigation. 

We propose now to examine the bias and mean square error of b; as 
they relate, in particular, to the choice of the w;. We first set down the 
model and the assumptions which underlie the discussion in Sections 
3, 4, and 5 below. We suppose that 


Y,=a-6X;, ee a) a (3° 


where a and @ are unknown parameters while the X; and Y; are un- 
known constants. The X,; are assumed to be ordered according to 
increasing magnitude. We observe x; and y; , where 


Li; Xk te, 
y= Y,+h- 


The e; and f; are random variables, representing errors of observation, 
such that 
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(1) e; and e; are independent if ¢ ¥ j, 

(2) f; and f; are independent if 7 # J, 

(3) e, and f; are independent for all 7 and j, 

(4) E(f,) = 0 and E(f;) = o7, = Pa 

(5) Let-¢ = mun) |e Then Prob {| e; | > ze} = 0. 
That is to say, the e; have finite range, extending from — 3c to 
1¢. This is the assumption which ensures that the ordering 
according to the x; is identical with the ordering according 
to) the Aone 

(6) E(e:) = Oand E(@) = of = we, BE) = ws E(e) = we. All 
odd moments of the e; are zero. 

(7) >> w,X; #0. It is easily verified that for each of the estimators 
considered this is the case. 


It should be stressed that the e; and f; represent errors of observation 
only since the X; and Y, are constants, otherwise e; and f; would 
obviously not be independent. As for assumption (5), it also ensures 
that the estimators are consistent, since SS w; = 0. In practice it is 
conceivable that one might wish to relax assumption (5) slightly to 
permit a distribution with infinite range but with most of the probability 
concentrated in (—c/2, c/2). The extent to which one might relax this 
assumption is not considered in this paper but will be investigated 
subsequently. 


3. BIAS AND MEAN SQUARE ERROR OF by. 


The bias, B, of b, , may be obtained in the following way: 


B(b,) = BCD wwBl soa) 


Now 


ii 1 
Do wa; P+ do we; 


-L 3 (yp Gene es 
k=0 F P ; s Fs ; 
ens 


where P = >) w:X;. 
Consequently, 


il t 
E(saa) = 5 do (—1)'P“*E(>, we)” + D, 
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p=) BX wey /[1 : pe} 


It has proved sufficiently accurate for the purpose of this investiga- 
tion to take t = 5. Then 


Ae 5 t+ p(X we, +3 OD eee Dy 


i [ower /(r+ 34] 


, we have 


where 


where 


Since E(>> w.y;) = 
E(b,) = af eae + 3: HD we.) | + BP D. 
Therefore 
B, = abe He Dwi + Si (we DY ws + Byp 2b wit | BPD. 
In all of the cases studied it is easily verified that 


—1< >i we,/P <1. 


Consequently, for the cases we want to investigate it is possible to find 
simple bounds for D. In fact, 


1 


1 6 
big peers tm 


To see this, one need only realize that for negative values of >> w,e; 


o< Shey i eee oy 


while for positive values of >> w,e; 
O< De COWAl a8 210s) < Ds (w.e,)°. 


Consequently, 
ne E De (we) | < bee eee 1H( > wee,)° 
ee Wi 2C ye | w; | 
Ue 
=f HOD W,e;)°. 
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Therefore 


1 il 
7 = il 
0 < D < 5p7 1 | w; |e 
1 yD onic 


(Ue oS w; + 15psp2 Se DE WiW; a 152 2a =, ww; W;)- 


ix 
Thus we have the means for approximating the bias to any described 
degree of accuracy, and furthermore, we have a way of assessing the 


accuracy of any approximation. ; 
It is possible to obtain the mean square error, M, of b, , in essentially 
the same way as B; was obtained. The result is 


i |} 2 >, Wi + a (us >, wi + 3y5 yz wiu | 
2 3 2 2 
aE doe E > wit pi me wi) 


+3 Daly Det +32 D Dw | 


+ A, where A = re/pe 


and 
27 {, _ de delwel\r a je De] wi |)" 
o<ac{eS(i P nc ie P 
Mio Dw! a te DD | w: ie a 
- {ne ys w; + 15psMo eS. » WW; + 15s pS Ie WiW; We} « 


4. OPTIMAL CHOICE OF WEIGHTS. 
We should like now to consider two possible sets of w; ; namely, 


1. The set which minimizes the bias B; , and 
2. The set which minimizes the mean square error M, . 


In general, of course, the two sets differ. The exact specifications of 
either set does not appear possible in view of the complexity of the 
expressions for B, and M,. On the other hand it is possible to obtain 
very good approximate specifications. 

If one chooses the w; according to the scheme of Wald, or Bartlett, 
or Housner and Brennan, max,, | es w,e; |/P turns out to be consider- 
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ably less than unity, even for values of n as small as four; this is a 
consequence of assumption (5) of Section 2. This leads one to conjecture 
that max | >> w,e; |/P is small for any reasonable choice of the w;—in 
particular, one would expect it to be small for good choices, like that 
which minimizes the bias or the mean square error. If this is the case, 
one would expect the bias and the mean square error to be determined 
primarily by the term of order P~’ in the expression for each; conse- 
quently, minimizing that term should approximately minimize the 
bias or the mean square error, as the case may be. Now this term in 
the expression for the bias is u.8P~” >> w? , whereas the corresponding 
term in the expression for the mean square error is .(8” + )P~? ie 
Obviously both of these terms are minimized by the same set of w,; , 
which is easily shown to be given by w; = (X; — X)C, where C is 
any arbitrary constant unequal to 0. 

Up to this point we have proceeded heuristically; we should now 
like to show with the aid of some examples that retention of terms in 
P~* makes only a trifling difference in the weights and far less difference 
in the bias itself. 

As the first example, we consider four points: X, = —3c/2, 
X_, = —¢/2, X,; = c/2, and X, = 3c/2. We take u, = 34(c/2)? and 
us = 4(c/2)* which are the moments of the rectangular distribution on 
the interval (—c/2, c/2), which is the maximum possible range conso- 
nant with the assumptions of Section 2. We might equally well consider 
any other admissible distribution on this finite range; the same tech- 
nique would apply; and the results would be substantially the same. 
The condition above gives as the weights w, = —3, w, = —1,w; = +1, 
ws, = +3 (or any multiple of these, of course). Minimization of 


1 4 2 
B, = 5| jh Hs Dwi + pi (Ga eon, > De ut | 
gives for the w, 


Wh —3.044, Wz, = ==. W3; = stale, Wy = +3 .044. 


This introduces a relative change in the bias AB,/B, of 9 x 10°’. 

As a second example, we consider the four points X¥, = —5c/2, 
X, = —c/2, X; = ¢/2, and X, = 5c/2, with pw. and p, just as in the 
previous example. The condition gives 


w, = —5d, WwW, = —1, w, = +1, Ws = +5. 


Minimization of 


B= epee Dut ie Dut +38 D Dw | 
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gives for the w; 
w, = —5.034, Ww, = —I1; W, = pls w, = 5.034. 

This introduces a relative change in the bias AB,/B, = 3 X 10°" 

It will be observed that the perturbations produced both in the 
weights and in the bias by the term in P~ are smaller in the second 
example than in the first. That is exactly what one would expect, for 
although the spread of the X, has increased in the second example, 
the range of the e; has not changed. This has the effect of decreasing 
max >, w,e;/P, thereby decreasing the importance of the term in 
p~ relative to that of the term in P-’. One might examine the mean 
square error in similar fashion. It is clear that the approximate minimi- 
zation of bias and mean square error given by choosing w; = X; — x 
is a very good approximation, and the estimator bo which employs 
these weights can for all practical purposes, be regarded as having 
minimum bias and minimum means square error. We shall refer to 
this choice of weights as optimal and to the estimator 


pS CX Ryd/1 Do ee X)=x;] 


as the optimal (ratio-of-linear-forms) estimator. 


5. COMPARISON OF VARIOUS WELL-KNOWN ESTIMATORS 
WITH THE OPTIMAL ESTIMATOR. 


In general the spacing of the X; is unknown, and the optimal esti- 
mator is therefore not obtainable. It is customary to use the weights 
of Wald, of Bartlett, or of Housner and Brennan. The corresponding 
estimators will henceforth be denoted by by , be , bx respectively, and 
the corresponding biases and mean square errors will also carry these 
subscripts. For example Bs denotes the bias of Bartlett’s estimator. 

When the X, are uniformly spaced, b, and bo are identical, which 
is to say that the Housner-Brennan estimator is optimal for this spacing. 
For any other spacing of the X, neither by , bs , nor by is optimal; the 
question is, how far do they depart from optimality. 

To answer this question we have studied the bias and the mean 
square error of these estimators for n = 6, 12 and for various types of 
spacings of the X, , which may be classified as follows: 


1. Symmetric spacing: --- , —5°p, —3°p, —1%p, l%p, 3°p, 
5p, tora 2.0) 

2. Symmetric spacing: —1°p, —2°p, —3°p, --+ , 3°p, 2°p, 1°p 
for w < 0 

3. Asymmetric spacing: 0, 2(1°)p, 2(2°)p, 2(3°)p, --- for » > 1 
and 0 << or 4 


4. Asymmetric spacing: —1°p, —2°p, —3°p, --- forw < 0. 
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We have taken the values of p to make the ranges of the selected X, 
values approximately the same, ensuring at the same time that condition 
(5) of Section 2 holds. In addition we have taken 


=a), m=5G)) w=4( 

ea RO) RES) Pa e NS) 3 
which coincide with the second, fourth, and sixth moments of a rec- 
tangular distribution having the maximum range consonant with the 
assumptions of Section 2. As indicated in Section 4 there is no serious 
loss of generality in confining our attention to a rectangular distribution 


of errors. 
Tables 1 to 5 present, forn = 6 andn = 12, 


1. The bias of the optimal estimator. 

2. The ratio of the bias of by , bg , and b, to the bias of the optimal 
estimator. 

3. The ratio of the mean square error of by , b; , and b, to that of 
the optimal estimator. 


The values cited in the tables are correct to within one unit in the last 
place given there. Results for other values of n can be found in Dorff [2]. 


6. LEAST-SQUARES-TYPE ESTIMATOR 


This section deals with bg = [D\(x, — @)y;]/ [>> (x; — #)x; , which 
is the estimator one would obtain if he minimized the sum of squares 
of deviations in the vertical direction, simply ignoring the fact that the 
x; are random variables. It is well-known that bg is not a consistent 


TABLE 1 


Maximum VALUE oF Ba /f. 


Values of w 
n 2.0 1.0) 0.5 —0.5 —2.0 
Symmetric Spacing 
6 . 0002 .0048 .0012 . 0004 . 0008 
12 .0000 . 0006 .0001 .0001 .0000 
Asymmetric Spacing 
6 . 0002 — .0014 .0006 .0000 
12 . 0000 — . 0002 .0000 .0000 


’For w = 1.0 the pattern of spacing presented would be symmetric; consequently no value is 
cited here. 
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TABLE 2 


Bras Ratio FOR SYMMETRIC SPACING 


Values of w 
Ratio 74. Ad) 1.0 0.5 —0.5 —2.0 
n=6 
Br/Bo 1.06 1.00 1.04 1.08 1.10 
Byw/Bo ee 1.30 1.09 1.06 1.74 
Bzg/Bo 1.2 1.10 1.14 1.25 1.38 
Be/Bo —3.00 —2.97 —2.90 —2.99 —2.99 
ij, = NP 
Bu/Bo 1.06 1.00 1.04 1.06 1.47 
Byw/Bo 1.78 1.32 Vegi) sala 2.93 
Bz/Bo 1.28 1.12 105 1.26 Zaile 
Bo/Bo —9.00 —9.00 —9.00 —9.00 —9.00 
TABLE 3 
Bras Ratio FoR ASYMMETRIC SPACING 
Values of w 
Ratio 2.0 0.5 —0.5 —2.0 
n=6 
Br/Bo 1.08 ae 0) iWeAtre 1.62 
Byw/Bo 1.41 1.58 1.69 273i 
Bz/Bo 1.19 i ae 1.34 2.03 
Bg/Bo —3.00 —3.00 —3.00 —3.00 
nm = 12 
By/Bo 1.08 1.09 2 eta! 
Bw/Bo 1.438 1.56 2.08 5.24 
Bp/Bo 1.20 Pat 1.61 3.66 
Ba/Bo —9.00 —9.00 —9.00 —9.00 
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TABLE 4 


Mean-Square-Error Ratio FoR SYMMETRIC SPACING 


Values of w 


Ratio 2.0 1.0 0.5 —0.5 —2.0 
n=6 

My/Mo 1.06 1.00 1.04 1.08 if At) 

My/Mo LS ileal 1.09 1.06 eZ! 

Ms3/Mo eo 1.10 il). it 1.26 1.38 

Mg/Mo 0.99 —! 0.99 0.99 1.00 
n = 12 

Myz/Mo 1.06 1.00 1.04 il Oy iL ere 

My/Mo 1.78 e352 i) ila il all 2.93 

Mz/Mo 1.28 Tee? 1 is eG 2.138 

Mg/Mo 1.00 OL 1.00 1.00 1.00 


‘Because of slow convergence of our approximation techniques for this combination of parameters, 
we have no reliable value to cite here. 


TABLE 5 


MEAN-SQUARE-ERROR FOR ASYMMETRIC SPACING 


Values of w 
Ratio 2.0 0.5 —0.5 —2.0 
8) 
My/Mo 1.09 ional tL 1.62 
My/Mo 1.41 1.58 1.69 2.81 
Msz/Mo 1.19 1.27 1.34 2.03 
Mg/Mo 1.00 0.98 1.00 1.00 
n= V2 
My/Mo 1.08 1.09 132 De oil! 
My/Mo 1.43 5G. 2.08 5.24 
Mz/Mo 1.20 Ni 7/ GL 3.66 
Mg/Mo 1.00 1.00 1.00 1.00 
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estimator of 8; however, it is conceivable that it might have desirable 
small-sample properties. We shall now investigate the bias and mean 
square error of bg , making the same assumptions as in Section 2. 
Tables 2-5 cite values of Bg/Bo and Mg/Mo correct to within one unit 
in the last place given there. It will be observed that Bg/Bo is approxi- 
mately —(n — 3) while Mg/Mo is nearly unity. 


7. DISCUSSION OF ESTIMATORS WHICH DO NOT 
REQUIRE REPLICATION 


We should like now to compare the estimators discussed in Section 2 
with a view toward making some practical suggestions for workers 
who are actually concerned with fitting straight-line relations. 

Generally speaking b, is clearly superior to by but there are some 
situations when such is not the case; we found that by had lower bias 
and lower mean square error than bz when the X, are symmetrically 
spaced, with w = 0.5 or w = —0.5. However, in almost every case 
investigated, by proved superior to by and to bz . This, of course, is 
hardly surprising. The use of by presupposes through the ability to 
order all the observations more information available to the experi- 
menter than that of by or bg , and estimators based upon more complete 
information are typically more efficient than those based upon partial 
information. We did, nevertheless, find instances where bw proved 
superior to bg and by. One of these is n = 6, ow = —0.5, symmetric 
spacing. For asymmetric spacings corresponding to these values of 
n and w, by was found decidedly superior to the others. 

As the comparison of by, and bg is slightly more involved, we should 
like to first compare bo with bg . The conclusion seems inescapable 
that when by can be obtained it is preferable to bg for two reasons: 


1) bo has much smaller bias than bg , while its mean square error is 
essentially no greater. 
2) bo is a consistent estimator of 8, whereas bg is not. 


The difficulty, of course, is that bo cannot usually be obtained, inas- 
much as the spacing of the X; is unknown. 

However it appears that b, is a reasonable substitute for bp when- 
ever the X, possess no marked skewness and are not bunched excessively. 
When such is the case we see that neither By,/Bo nor My/Mg will 
ereatly exceed unity, while By/Bo will be approximately —1/(n — 38). 
Thus, when using b, in preference to bg , one stands to do much better 
with respect to bias and very little worse with respect to mean square 
error. 


One could roughly assess the skewness or bunchiness of his data by 
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plotting the x; versus 7 on log-log paper; the slope of the line which 
best fits the points is then w. Ordinarily this should not be necessary; 
in most cases one could decide whether there was evidence of skewness 
or bunchiness by merely looking at the data. 


8. ESTIMATORS WHICH REQUIRE REPLICATION. 


We consider now various estimators of @ suitable when replicated 
observations are available. We suppose that 


Vow BX es = 1, Dye. 5 n; 
just as in Section 2, but now we have observations 
te =X, +6, 
Yo = 1G oP lee, 
where? = 1,2, +-- ,r(r > 1); 


e;, and f;, are random variables, representing errors of observation, 
such that 


1) e,, and e;, are independent unless 7 = j and? = u. 

2) fi, and f;, are independent unless 7 = j and t = u. 

3) e;, and f;, are independent for all 2, 7, ¢, wu. 

4) Let ¢ = min; | X;,, — X; |. Then Prob {| e;, | > }r=20) 

5) E(éz,.) = ue, E(e,) = wm, E(e,) = us. All odd moments of 
the e;, are zero. 

6) Prob {| f;. | > 3h8c} = 0. That is to say, we now restrict fj, 
to a finite range (—3hGc, $hGc); the symbol h is defined by this 
relation. 


7) E(fi:) aoe ee h’B’ us ) Efi.) SSA lime! h'B' us ’ E(fi.) a eras h°B* 6 : 


All odd moments of the f;, are zero. (These assumptions would 
be satisfied if the distribution of the f;, differed from that of 
the e;, only in its range.) 


In our previous paper [3] we considered the following estimators 
of 8, all of which are consistent provided that their denominators have 
non-vanishing expectations: 


Bay en Wi 
by a ‘Bey (bs a Was) be cai (Bie a: We) (Bw ’ bs = jr a B 
where 
2 _ . 
lee = ) ? Wiz = eT pre 1) SS ye (tis Le) d 
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ke 2 = 1 . — . z 
By Sy Oe ea Pe Dare ye 


By =—— DL. — Yi. — Y.); 
and sgn B = +1ifB > 0 
—1if6 < 0. 


We have applied the techniques of Section 3 to the study of the bias 
and mean square error of these three estimators for comparison with the 
bias and mean square error of bo = (oy. See Ny MVD (kX - X)z,). 
We found that b, , 0, , bs and by have approximately the same mean 
square error but that their biases can differ greatly. In fact, when the 
errors are sufficiently small B:/Bo = 2, B./Bo = 1 — i, BY Bs = 
(3 — h’)/2. Thus, the absolute bias of b, is smaller than that of 6. 
whenever h > V3 : the absolute bias of b; is smaller than that of b, 
and b, whenever 4/5/83 “how 7. Now k = vise. and since 
y>/p> is ordinarily unknown, we are unable to decide whether 4/5/3 < 
h < +/7 or not. This would seem to rule out b; as a useful estimator 
of 8. On the other hand in many problems it might well be possible 
to decide whether h > +~/3 or not. Inasmuch as }, and 6, have approxi- 
mately the same mean square error one could then use the bias as a 
criterion for choosing between the two: ie., 


When p’ < role use 0, ; 


When p > ratte use Do. 


It is interesting to note that analysis of the asymptotic variance of 
b, and b, in our previous paper [3] led us to recommend that 


when § <»/p., use 0,, 
when 8 > ¥a/p25 Use. bo 


so that the large sample approach based on variance and the small 
sample approach based on bias lead to somewhat different conclusions. 

For values of h larger than ~/2, bo would be preferable to b, , but 
unfortunately bo is not usually obtainable. We could, of course, use 
by , bw , or bg in lieu of b, at some sacrifice of efficiency, but with a 
corresponding reduction of the magnitude of the bias. If one knew 
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something about the spacing of the X; , this might be a desirable way 
to obtain an estimate of 8, but if nothing is known about the X, , it 
seems advisable to use either b, or b, . In any case the use of b, or b, 
does not require any assumption about the possibility of ordering the 
observed values correctly in contradistinction to the estimators 
by, bw,bg,0rbo. 


9. CORRELATED ERRORS 


In some investigations it would not be reasonable to assume the 
errors e; and f; to be independent; one should properly take account of 
correlation in such cases. The correlation of errors greatly increases 
the complexity of the problem and an examination of small sample 
properties for such situations has not been included here. 

We have, however, examined in [2], [3] the asymptotic variance for 
consistent estimators of 8 when the errors are correlated. In this case, 
the estimators b, , b, , and b; are modified as follows in order to retain 
the property of consistency: 


b, = (Bes Pa Weis ai es be = (Bie ae Wa (ee ie: Was 
bs = V(@,, —W,)/(Bezs — W..) sen 8. 


Essentially the same conclusions were obtained as in the case of inde- 
pendent errors: 


1. When 8” < y2/p2 , use by , 

2. When §” > »2/u2 , use bp , 

3. b, is advantageous for only a narrow range of values of the 
parameters and is therefore not likely to be of much interest. 


10. CONCLUSIONS 


In this paper we have considered the problem of estimating the 
slope of a linear functional relation when both variables are in error. 

When replicated observations are made a number of estimators are 
available, all of which have approximately the same asymptotic variance 
and mean square error, but study of the bias enables us to choose 
among them when we have partial information concerning the 
parameters. 

When there is no replication of observations and the errors are 
suitably restricted, there exists a class of consistent estimators which 
are ratios of linear forms in the observations. When partial information 
is available concerning the spacing of the X; there are rational grounds 
for choosing a suitable member of the class. 
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ADRIANUS J. VAN WOERKOM AND KEEVE BRODMAN 


Cornell University Medical Center 
New York, New York, U.S. A. 


INTRODUCTION 


In recent years, several methods have been proposed for making 
medical diagnoses by machine (Ledley and Lusted [1959], Crumb and 
Rupe [1959]). A method devised by Brodman et al. [1959, 1960] has 
been used to program a high-speed electronic computer for making 
presumptive medical diagnoses using only information relating to the 
age, sex, and responses of patients to a standardized health questionnaire. 

The method assigns patients to none, one, or several of 60 selected 
disease categories. The 60 diseases most frequently diagnosed by 
hospital physicians in men and the 60 diseases in women were chosen 
for study. The method was developed with data referring to 5,929 
consecutive adult white patients (2,718 men and 3,211 women) ad- 
mitted to the outpatient departments of The New York Hospital, a 
large general hospital, during the 18-month period beginning July 
1, 1948. It was tested with data referring to 2,745 consecutive adult 
white patients (1,280 men and 1,465 women) admitted during the 
12-month period beginning January 1, 1956. 

Each patient’s symptoms were elicited through a printed form, the 
Cornell Medical Index-Health Questionnaire (CMI). The CMI was 
devised to collect diagnostically important elements of the medical 
history given by general medical patients, without expenditure of a 
physician’s time. Solely with these data, a physician can often correctly 
predict which diseases will be found in subsequent examination (Brod- 
man et al. [1951]). 

Additional data abstracted for analysis from the hospital records 
include each patient’s sex and age, along with the diagnoses made by 
hospital physicians after eliciting a history and performing physical 
and laboratory examinations. These diagnoses for the 1948-1949 data 
were coded according to the U. S. Public Health Service M anual for 
Coding Causes of Illness [1944] and form the standard against which 
the method is evaluated. 


1Leo S. Goldstein and Albert J. Erdmann Jr. participated in this study, which was carried out at 
the Department of Medicine, New York Hospital-Cornell Medical Center with the support of a Public 


Health Service Research Grant. 
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The problem may be phrased in the following terms: Patients in 
the sample may belong to a number of specified disease categories. 
Patients were assigned by physicians to disease categories with a low 
incidence of false positives (incorrect diagnoses) but were not assigned 
to every category to which they belonged. Each patient has a number 
of attributes (complaints on the CMI). These attributes may or may 
not be correlated with each other. No one-to-one relation exists be- 
tween categories and attributes. The problem is to assign patients to 
categories solely by means of attributes indicated on the CMI, with a 
maximum of correct and a minimum of incorrect assignments. 

There are two obvious approaches to the solution of this problem: 


1. The use of the conditional probabilities of a patient’s having 
an attribute, given the category to which he belongs. From these, 
using Bayes’ theorem, the conditional probability can be found for 
the patient’s belonging to a specific category, given the attributes 
(Feller [1957], Chapter 5). 

2. The use of significance values or weights for the attributes with 
respect to the categories. 


CONDITIONAL PROBABILITIES 


Let P, be the probability or relative frequency of attribute A, in 
the total sample, and p;, the conditional probability of the attribute 
A, given the category D; . Then 


De Pr Ay De (1) 


If n,; is the sub-sample size of category D; and N the size of the 
total sample, then 


PrCD;) = 1,7N- (2) 

The conditional probability for a group of attributes A,, , --- , Ax, 1s: 

Pr (Ay: Pate A,, | D;) = Pr (Ax, D;) Pr (Ax, D;) oe Pr (A,, | Dye (3) 

Using Bayes’ theorem, the probability of the patient’s belonging to 
category D; given the attribute complex A,, , --- , A;, is: 


Pr (D; | A, Chichewa Pr (Ax, | Deere er CAs | Diyebe (D;) 
Spray Dee ee ens 


(4) 


The expression (4) for Bayes’ rule will have to be adjusted for correla- 
tion between the attributes. As can be seen from Table 1, the sample 
sizes of the 60 selected diseases are generally quite small. These sample 
sizes are of the order of 30, which makes the measurements of the 
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TABLE 1 


SAMPLE Sizes or 60 DisEASES IN MEN AND WoMEN. 


Sample Men Women 
Size N N 
10-19 16 8 
20-29 13 12 
30-39 8 14 
40-49 6 6 
50-99 10 12 
100-149 6 5 
150-404 1 3 


correlation coefficients uncertain. In Table 2 a tabulation is given for 
the correlation coefficients for the symptoms with higher proportionality 
in the disease sub-samples than in the total sample. The coefficients 
are reduced to unit variance. Only 22 percent of these correlations are 
more than twice their standard error. 

In general, a patient has more than two symptoms and multiple 
correlations will have to be considered. Because of the small sub- 
sample sizes, these multiple correlations, either directly observed or 


TABLE 2 


DISTRIBUTION oF ITEM INTER-CORRELATIONS ReDUCED TO UNIT VARIANCE, 


Correlation /Variance N 
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TABLE 3 


NuMBER OF DISEASES DIAGNOSED PER Patient (MAXIMUM OF 3). 


Number of N % 
Diseases Men Women Men Women 
1 1456 1635 54 51 
2 646 811 24 25 
3 616 765 22 24 


derived from the covariances, will be more uncertain than the zero 
order correlations and their use could lead to a large amount of bias. 
In view of this uncertainty and the tremendous amount of work in- 
volved, the measurement of all the correlations needed for the proper 
application of Bayes’ rule is not justified. 

Of particular interest with respect to the use of Bayes’ rule, many 
patients have more than one disease. In Table 3 a distribution of 
diagnosed diseases is given for the 1948-1949 New York Hospital 
population. 

The presence in a patient of more than one disorder considerably 
complicates a method using conditional probabilities. Symptoms 
claimed by a patient with a particular diagnosed disease may be related 
to other and often undiagnosed diseases. Due to a random distribu- 
tion of most diseases with respect to each other, the co-existence of 
diseases in a patient will not seriously obscure the cluster of symptoms 
having substantially higher proportionality within a disease category 
than in the total sample, nor will the co-existence of diseases seriously 
distort the correlations of these symptoms within the disease category. 
Correlations between symptoms which have an incidence within the 
disease category similar to the incidence in the total sample, however, 
may be due to other diseases. 

Some of these difficulties could be avoided by studying disease 
complexes rather than single diseases. It is evident from the sample 
sizes of single diseases as shown in Table 1, however, that the study of 
disease complexes in the sample available would not have been possible 
because of the small sub-sample sizes for these complexes. Even in a 
sample much larger than the one available, compounding of error 
because of false negatives would still be uncontrolled. 

The effect on the conditional probabilities of the presence of diseases 
other than the one being studied would be partially eliminated by using 
only those attributes which have a substantially higher incidence within 
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TABLE 4 
HYPOTHETICAL SAMPLE FOR CONDITIONAL PROBABILITIES. 
D, Dz, 
n 90 10 
A; 33% 60% 
As 33% 60% 
n(A,Ao) 9 4 
Pr(D|A,A2) 9/13 4/13 


the disease category than in the total sample. This approach would 
also eliminate those symptoms which occur less frequently in the 
disease than in the total sample. (In clinical practice, these symptoms 
are given little diagnostic weight.) 

Another objection to the use of Bayes’ rule is that the conditional 
probabilities for a disease, given a set of attributes, is directly propor- 
tional to the sub-sample size of the disease category. (The medical 
profession first considers diseases with a high incidence but does not 
make a diagnosis on this basis.) 

If the data available are consistent and complete, Bayes’ rule will 
give a mininum number of wrong assignments. If the data are incon- 
sistent and incomplete, as in the sample studied, there may be many 
cases of wrong assignments and methods other than Bayes’ might be 
more effective. Referring to Table 4, if the 9 cases in D, with attributes 
A, and A, were false negatives in the data with regard to D, , the 
hypothesis that all patients having A, and A, belong to D, will be 
more effective than the use of Bayes’ rule. This hypothesis can be 
tested only by re-examining hospital records or re-calling patients. 

Notwithstanding the objections raised, the conditional probability 
method, with the modifications described above, was tested with the 
1948-1949 sample of male patients. The results showed a number of 
expected inconsistencies. Nearly always, those categories with the 
largest number of attributes with high relative frequencies were favored. 
In addition, categories with small sample sizes and with a small num- 
ber of attributes of high relative frequency (that would have been 
identified by a physician in many cases) failed to get probabilities high 
enough to be considered. 

The most reliable information in the data is the diagnoses made by 
physicians after they have examined the patients. The one thing that 
can be established with minimum error from the data studied here is 
the significance of an attribute with respect to a disease. The con- 
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ditional probability approach fails to make use of this information. 
It was found from clinical experience and from noting the relative 
frequencies or proportionalities P, and p;, that a disease category is 
characterized by a cluster of attributes each with a much higher pro- 
portionality in the disease category than in the total sample. If a 
patient’s cluster of complaints is similar to the cluster of complaints of 
the average patient in that disease category, he can be assigned to the 
category. This approach is discussed in the following section. 


SIGNIFICANCE VALUES 


Let the variable S/, be given as 
Si, = ni (pir — Pi)/V miP 1 — Py), (5) 


where the notations are as before. It can be shown that in a sample 
where the attribute A, is randomly distributed with probability P, , 
the variable S/, is approximately normally distributed if n;P,(1 — Px) 
is not too small. However, according to Feller {1957 p. 170], the error 
committed in replacing the binomial distribution by the normal is 
surprisingly small, even for values of n;P,(1 — P,) of the order of 1.5. 
The comparison between the two for the probabilities of the variable 
between two boundaries is by far not as good, but this is of little conse- 
quence in this case. Then in the sub-sample of size 7; , the probability 
of observing the value S/, by chance is proportional to the expression 
exp (—S/?). The observed value of S/, in the sub-sample of category 
D, therefore expresses a measure of significance in the usual sense. 
Because the constant ; occurs in the significance values for all attributes 
with regard to the category D; , this factor can be dropped from the 
expression. Then, 


= (pa — P)/VPAL — P) ~ Qin — Pid)/ VP? - (6) 


The term (1 — P,) varies between .99 and .46 with nearly all values 
around .80 with the square root equal to .9. By ignoring the term 
(1 — P,) an error is made of approximately 10 percent. This error is of 
little consequence because of statistical inaccuracies in the other 
parameters. 

Let all attributes A, for which S/f > 2 be called significant attributes. 
If the complex of significant attributes contains more than one attri- 
bute, >> S//? represents a measure of the joint probability of the 
attribute complex and thus the significance of the complex. The signifi- 
cance value of the complex should also include the correlation between 
the attributes. As mentioned before, however, in the sample under 
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study these correlations are generally insignificant and a linear model 
can be used. 

The variable S// measures not only the significance of the attributes 
but also their correlation with respect to the categories. The observed 
coincidence of category D; and attribute A, is equal to (n;/N)pi, and 
the product of the expected values is (n;/N)P,. The co-variance is 
then equal to 


(n;/N)pi. — (n;/N)P. = n;/N(pu. — P.). (7) 


In order to reduce (7) to the correlation coefficient, the expression 
has to be divided by the square root of the product of the variances of 
n;/N and P,. The factor n;/N is constant for all attributes in the same 
category and therefore can be eliminated. The variance of P, depends 
on the universe from which our sample is drawn. If the universe has a 
Poisson distribution the value is equal to P, ; if the universe is binomial 
the value is equal to P,(1 — P,). 

The resulting expressions 


(pin —P)/VP. or (py — P)/VPL — Pi) 


are identical to S4f (6). 
The following form was adopted for S;, : 


Sie = (pn — P)/2VPil — 1, (8) 


where the factor 2 in the denominator yields a convenient scale. A 
minimum value of 3 was originally set for \/P, . Further study of 
the data indicates that setting a minimum value for this is unnecessary, 
and that all negative values can be set to zero. 

If the attributes A,, , --- , Az, are present in a patient, the corre- 
sponding sums 


are called the scores. The scores for a patient are normalized by divid- 
ing them by the mean score for the entire category. This normalization 
compares the patient to the average patient. If the sample sizes had 
been kept in the variables S,, they would have cancelled out at this 
point. 

It cannot be assumed that the score as indicated in formula (9) 
will give optimal results. The score is essentially a measure of the 
probability of the chance existence of any sub-sample with the same 
relative frequency p;, for the attributes Ax, , --- , A,, a8 in the sub- 
sample of category D; . When the score is high this probability is low, 
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that is, a high score indicates a small probability for the patient’s not 
belonging to the category. Had the linear sum of the significance 
values S;, been used to determine the score, small changes in the 
significance values would be reflected by large changes in the probabili- 
ties. This would make the score a bad indicator of the probabilities. 
If powers of S;, higher than the second were used, the score would be 
a bad indicator for the opposite reason. 

Various models (linear sum, sum of squares, sum of higher powers, 
etc.) may be compared quantitatively only if the distribution of scores 
in both the restricted and the category sub-samples are known. By 
restricted sample is meant the total sample minus the category sub- 
sample. These distributions could not be obtained for the sample 
studied, because the restricted sample contains patients who really 
belong to the category sub-sample but have not been assigned to it in 
the hospital (false negatives). Some of the disease categories, further, 
are significantly correlated. These correlated category sub-samples 
must be removed from the restricted sample in order to get the desired 
distribution. Removal of false negatives and adjunct category sub- 
samples is impossible for a large sample. 

Physicians, when making a diagnosis, consider the patient's age. 
Table 5 shows the age distribution by decades of the 15 diseases diag- 
nosed most frequently in men and in women in the 1948-1949 popula- 
tion. The relative frequency of a disease category is a function of age. 
The use of these relative frequencies in the sense of relative probabili- 
ties is not consistent with a model based on significance values of the 
attributes. There is, however, another aspect to be considered. Let 
the sample be subdivided into a number of age groups. The relative 
sub-sample sizes (r;)m for disease category D; and age group m will in 
general not be constant with respect to m. The significance value of an 
attribute with respect to a category is a function of the sub-sample 
size. If no subdivision of the total sample is considered, the effect of 
the sub-sample size is eliminated because the score of the patient is 
always compared to the score of the average patient and the sub-sample 
size does not have to enter into the significance value. If the same 
comparison is made in case a subdivision exists, the sub-sample size 
will have to be considered in the significance value. The expression 
for (Sjx)m Will be 


VO )eQiac es ee Vi) Dae Py). 
VP. — Pi) VP, 


The mean scores for each age group within the disease category 
were computed to test the possibility that the relative frequency p;x 


(Sidm = (10) 
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of the attributes within the category sub-sample is a function of age. 
No significant interaction was found between the mean scores and age. 

In order to smooth the relative frequencies of the disease categories 
with respect to age, moving averages of three successive decades were 
used. After the >> §?, scores were formed, using formula (8) for 
S;, , the scores were multiplied by (r;)» . 

This modification of the significance value changes the distribu- 
tions of the scores for both the categories and total sample so that 
more success can be expected from the model. Indications of this are 
made in Figure 1, where J is the distribution curve for the restricted 
sample and JJ is the distribution curve for the sub-sample category. 
The dotted lines show the distribution curves using the formula modi- 
fied for age. The trend will be primarily towards a smaller number of 
wrong assignments (false positives). 


NUMBER OF CASES 


SCORE 
FIGURE 1 


DISTRIBUTION OF SCORES FOR CATEGORY AND TOTAL SAMPLE. 
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Because the process of assignment by physicians at The New York 
Hospital for the sample of patients under study cannot have been 
entirely divorced from the attributes, bias in assignment may be ex- 
pected with respect to the attributes. Bias in p,;, arises partly as a 
function of the clinical process used in making a decision. A physician 
in oral interview elicits a patient’s symptoms and then investigates 
with physical and laboratory examinations the diseases he presumes 
to be associated with the symptoms. Some of these symptoms will be 
similar to those indicated by the patient on the CMI. The presump- 
tive diagnoses suggested by the symptoms reported to the physician 
can thus be expected to be related to the diagnostic assignments made 
from the attributes on the CMI. Bias in p;, arising from the clinical 
process is inherent in the data and cannot easily be corrected. 

In Table 6 is given a sample of the relative frequencies for 27 se- 
lected items on the CMI with respect to 6 selected diseases for men in 
the 1948-1949 population. The numbers in the heading of the table 
correspond to the item numbers on the questionnaire. The relative 
frequencies are expressed in percent. In the 7th line of the table the 
corresponding relative frequencies are given for the entire 1948-1949 
sample of male patients. The last six lines of the table give the corre- 
sponding significance indices (S;,). As can be seen, the complaints for 
the 6 selected diseases have very little overlap and form a good basis 
for differentiation of the diseases. The disease categories and items in 
the table were selected to illustrate this characteristic. The table 
shows that differential classifications can be made on the basis of the 
significance indices. 

The sample in Table 7 was chosen from the male 1948-1949 popula- 
tion to present a case where overlap among diseases exists in the com- 
plaints. In the first column of the table the item content is given with 
the item number from the questionnaire. Column headings identify 
the diseases. The last column gives the number of diseases for which 
the particular item is significant and the last line in the tabie shows the 
number of these items significant for the disease. All significance 
indices smaller than 3 are omitted from the table. Even with the 
overlap present, the table still shows that differentiation among diseases 
can be made on the basis of the significance indices. Table 8 gives a 
comparable illustration for the women in the 1948-1949 population. 

Some points in Tables 7 and 8 are of interest: 


1. In Table 7, item 56 of disease 121 (NV = 12) illustrates bias 
introduced because of small sample size in the disease. 
2. In Table 8, item 41 of disease 382 is related to age. 
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TABLE 9 


Scores For 6 PATIENTS. 


Sex— MMMF F F 


Age— 66 35 26 22 32 32 


Number of Complaints— 68 11 85 52 47 27 


Disease Score Disease Score 

1 68 00 00 00 00 00 31 25 00 00 00 00 37 
2 12 00 12 00 00 00 BP 12 00 12 00 12 12 
3 00 00 00 00 00 00 33 75 00 00 00 00 00 
4 50 00 00 00 00 00 34 95 00 00 00 18 00 
5 87 00 00 00 00 00 35 00 00 00 00 00 00 
6 43 00 00 00 00 00 36 00 00 18 00 37 00 
i 00 00 00 00 00 00 ot 75 00 00 00 31 00 
8 00 00 00 438 12 12 38 12 00 18 00 18 00 
9 18 00 00 00 00 00 39 00 00 50 18 12 12 
10 18 00 25 00 00 00 40 12 00 25 00 00 00 
il 18 00 95 00 00 00 41 31 00 00 00 00 00 
Me; 00 00 50 37 25 12 42 18 00 12 00 00 00 
13 00 00 50 00 00 00 43 68 00 00 00 00 00 
14 00 00 00 00 00 00 44 50 00 00 00 00 00 
15 00 00 95 00 00 00 45 75 00 00 00 12 00 
16 00 00 62 00 00 00 46 43 00 12 81 62 00 
17 00 00 00 00 00 00 47 31 00 00 00 00 00 
18 00 00 00 56 68 43 48 00 00 00 00 00 00 
19 37 68 75 00 00 00 49 00 00 00 00 00 00 
20 87 00 00 00 00 00 50 12 00 00 00 00 00 
Pall 93 00 00 43 25 12 51 31 00 00 00 00 00 
22 75 62 62 00 00 00 52 56 00 00 00 00 00 
23 75 00 00 00 00 00 53 81 00 00 00 00 00 
24 25 00 00 00 62 00 54 25 00 37 00 00 00 
7) 00 62 00 00 00 00 55 56 00 00 00 00 00 
26 00 00 00 00 56 25 56 00 00 00 00 00 00 
27 00 00 37 00 00 00 57 00 00 00 75 87 00 
28 12 00 43 00 50 00 58 00 00 31 00 00 00 
29 87 00 12 00 31 95 59 18 00 00 00 00 00 
30 37 00 50 00 00 00 60 00 00 00 18 31 00 


3. In Table 8, item 28 of disease 382 represents the co-existence of 
disease 370. 


In the study of Brodman et al. [1959], the >> (S;,)2 model was used 
to diagnose diseases in the 1956 sample of patients. A constant criti- 
cal score of 35 for all categories was adopted, after examination of the 
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scores obtained for the 1948-1949 sample. The use of critical scores 
determined for each disease independently may improve the model. 

Table 9 gives the scores for 6 randomly selected patients diagnosed 
in the hospital as having rheumatic heart disease. The heading of the 
table gives the sex and age of the patients and the number of “yes” 
responses to the questionnaire. The number of diseases indicated for 
the 6 cases in the table ranges from 21 for the first case to 3 for the 
second and for the last case. 

The model developed with the S;, values derived from the 1948-1949 
sample of patients was used to program a data processing machine 
(van Woerkom [1960]) and was tested with the 1956 sample of patients. 
The machine made correct assignments in 44 percent of cases where 
the 60 diseases were diagnosed in the hospital. In order to test for 
false positives, that is, diseases diagnosed by the machine when the 
disease was not present, a sample of 350 patients was selected for which 
the hospital records could give evidence for the presence or absence of a 
disease. In only 5 percent of cases did the model generate false posi- 
tives. Results with the 350 patients are essentially the same as those 
obtained by a physician experienced in the interpretation of the CMI, 
except that for the diagnosis of psychoneurosis the physician made 
correct assignments more often than did the machine (81 percent as 
compared to 42 percent). 


CONCLUSIONS 


An approach used in efforts such as the one presented in this paper 
is best guided by the characteristics of the data interpreted. It is 
primarily because of the many false negatives in the large amount of 
data analyzed that the present model was developed. When small 
amounts of accurately defined data are analyzed the method may 
require revision. This paper presents a justification of the approach 
reported here, on an analytic basis and on the basis of the results ob- 
tained showing few false positives among many correct assignments. 
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159 NOTE: Three-Quarter Replicates 
of 2* and 2° Designs 


Peter W. M. Joun 


California Research Corporation, Richmond, California and 
University of California, Berkeley, California. U. 8S. A. 


1. THE 24 DESIGN 


The usual half replicate of the 2* design is defined by 1 = 421220304; 
the main effects are clear of two-factor interactions, but the two-factor 
interactions themselves are confounded in pairs. The three-quarter 
replicates given in this section enable both the main effects and two- 
factor interactions to be estimated clear of two-factor interactions. 

The designs are obtained by omitting any one of the following 
quarter replicates, defined by x, = +a, ,2%3 = 42,22. 


Te cnc: bed, abode ref a i Fide 
Wrist) ancd, bd t= et = ie 
Rit 0 abe. ed. 00-2) == es Si 
WVa ab, be, acd, a> o.-—= —2> 3 =] — Fits: 


We consider the case in which I is omitted. 

The three quarters may be combined into three half replicates; IT, ITI, 
Porene een, LV ey re I LV ee aio 

We estimate A from the half replicate III, IV, where 7; = —2,2%2%, , 
so that A is confounded with ABD. In the other half replicates A is 
confounded with the two-factor interactions BC or CD. The other 
effects are obtained, each from a half replicate, in the same way and 
the analysis is presented in tabular form below. It may be shown 
that these are the least squares estimates, when higher-order inter- 
actions are assumed to be negligible. BD, and the mean, may be 
estimated from two half replicates; the least squares estimates are 
then the averages of these two estimates. 


2. THE 2° DESIGN 


The main effects and two-factor interactions in the 2° experiment 
can all be estimated from a half replicate. The variance of each esti- 
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Effect Sets Used Confounding 
A Ill, IV; 24 = —22 ABD 
B Ue VE XL4 = LM1%3 ABCD 
©; III, IV; 71 = —22 BCD 
D II, 111; 73 = 222 ABCD 
AB II, 1V; v4 = 2123 BCD 
AC NOU IAS ee =o ABCD 
AD II, II; 23 = 2x2 BCD 
BC JI, IV; 24 = wiv ABD 
BD sUL- LV Ts, = L173 ABC 

or II, II]; 73 = 2i%2 ACD 
CD II, II; 23 = ize ABD 


mate is 07/4, and all the degrees of freedom are used for effects. We 
present in this section a three-quarter replicate in which all the effects 
are estimable with variance 307/16 and which provides 8 degrees of 
freedom for error. 

The three-quarter replicate using three quarters defined by 
Ys = ts , te = 7X27, is discussed in Davies [1956]. In this 
design all the main effects and two-factor interactions, except H, AD, 
BC, are estimated by averaging the estimates from two half replicates 
with variance 307/16. The latter three effects, however, are estimated 
from only a single half replicate with variance oa’ /4. 

If the quarter omitted is defined by 7 = +2, , %; = 4%2% , We 
have a design in which each main effect and each two-factor interaction 
is estimated from two half replicates. The particular design obtained 
by omitting the quarter 7, = —2% , %; = +x; has the additional 
advantage that it may be developed by augmenting a ‘‘one-at-a-time”’ 
experiment, Daniel [1958]. 


3. SOME ADDITIONAL DESIGNS 


The above designs possess a certain symmetry inasmuch as each 
factor appears at its high level and at its low level the same number 
of times. Another set of designs may be obtained when a half repli- 
cate, in which the highest-order interaction is confounded with the 
mean, is augmented by a quarter which also confounds a main effect. 


For example, one might take three of the sets of treatment combinations 
defined by 


1 = a, = 1xorsvi(es) = 1LiC2Xe0a(as). 


In this design the first factor appears at one level in 2”~’ points 
and at the other level in 2”-” points. 
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It may nevertheless be shown that the method of analysis given 
in the previous sections still gives the least squares estimates. Such 
a design has the advantage in the 2* case over the design given in 
Section 1 that the effect which is estimated with greatest precision is a 
main effect rather than a two-factor interaction. 
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160 NOTE: On the Extension of Stevens’ Tables 
for Asymptotic Regression 


S. Lipton 


University of New South Wales 
Kensington, Sydney, Australia. 


Concerning H. Linhart’s note ({1960]. Biometrics 16, 125), I think 
it is worthwhile pointing out my own extension to Stevens’ tables. This 
work has been reported by H. D. Patterson ([{1956]. Biometrics 12, 
389 and [1958]. Biometrika 45, 323) and was carried out at Rothamsted 
Experimental Station. The extended tables give, in Stevens’ notation, 
the six F’s for r = .10(.01).90 for each integral value of n from 3 to 12 
inclusive. These values do not in the main overlap Linhart’s work. 

The tables proved surprisingly ‘‘popular’’ and I received several 
requests for copies. However, as the entries corresponding to large r 
for small n seem to be somewhat inaccurate, (apparently they are 
correct only to about 4 significant figures compared to more than 6 
figures elsewhere) I am now recomputing the values. I intend to take 
the opportunity of further extending the range of n up to 15. 


161 NOTE: Corrected Error Rates for Duncan’s 
New Multiple Range Test 


H. Leon Harter 


Aeronautical Research Laboratory 
Wright-Patterson Air Force Base, Ohio, U.S. A. 


The error rates published by the author [3] for Duncan’s new 
multiple range test were based upon critical values tabulated by Beyer 
{1] and by Duncan [2], some of which were in error, as reported earlier 
by the author [4]. On the basis of the corrected critical values for this 
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CORRECTED TABLE 1C 
MIntmuM VALUES OF az CORRESPONDING TO as = 0.05, 0.01 


(az, )min | as = 0.05 


mo N #2) .N = 34 N = 2 | Ne N10 N=16|N =25| N-o@ 
2 .0500 .0500 .0500 .0500 .0500 .0500 .0500 .0500 
3 .0496 .0443 .0425 0411 .0402 0396 .0395 .0391 
4 .0463 .0395 .0372 .0354. .0342 .0338 .03834 .0329 
5 .0429 .0357 0834 .0315 .0301 .0298 .0295 .0290 
6 .0400 .0329 .0305 .0287 .0275 .0269 .0266 .0261 
7 .0375 .0305 .0283 .0265 .0254 .0248 .0245 .0239 
8 .0354 .0286 .0265 .0247 .0236 .0231 .0228 .0223 
y .0335 .0270 .0250 .0233 .0222 .0216 .0214 .0209 
10 .0320 .0256 .0237 .0221 .0210 .0206 .0203 .0199 
12 .0294 .0236 .0216 .0202 .0192 .0188 .0185 .0181 
14 0274 .0219 .0202 .0188 ADI AS .0175 .0172 .0168 
16 .0257 .0206 .0189 .0176 .0168 .0164 .0162 .0158 
18 .0244 .0196 .0180 .0167 .0160 .0155 -0153 .0149 
20 0233 .0186 0172 .0160 .0153 .0148 -0146 .0142 


(er )min OG = 0.01 


m |\N=2(\|N=3|N=4)/N=6|N =10 | N = 16 inf = PS || ee 


2 .0100 .0100 .0100 .0100 .0100 .0100 -0100 .0100 
3 .0098 .0085 .0081 .0077 .0075 0074 .0074 .0073 
4 .0089 .0072 .0067 .0063 .0061 .0060 .0059 .0058 
5 .0079 .0063 .0058 .0054 .0052 .0050 .0050 .0049 
6 0071 .0056 .0051 .0048 .0045 0044 .0044 .0043 
fl .0065 .0050 .0046 0043 .0041 .0039 .0039 .0038 
8 .0059 .0045 0041 .0039 .0037 .0036 .0035 .0035 
9 .0054 .0042 .0038 .0036 .0034 0033 .0033 .0032 
10 .0050 .0039 .0035 .0033 .0031 .0030 .0030 .0029 
12 .0044 .0034 .0031 .0029 .0028 .0027 .0027 .0026 
14 0039 .0031 .0028 -0026 .0025 .0024 .0024 .0023 


16 .0036 .0028 .0025 .0024 .0023 .0022 .0022 .0021 
18 .0033 .0026 .0024 .0022 .0021 .0020 .0020 .0020 
20 .0030 .0024 .0022 .0020 -0019 .0019 .0019 .0018 


QUERIES AND NOTES 323 


CORRECTIONS TO TABLE 2 
CoMPARISON oF Type II-III Error Rates For SINGLE CLASSIFICATION 


A. Values of 82 = (8§)max for a = 0.05 | B. Values of 8 = (84) max for a =-0.01 


N m | &=1.5 8’=2.0 8’=2.5 8’=3.0| N m | 8'=2.0 0’ =2.5 8’=3.0 8'=3.5 
5 3] .464 .198 .0602 Bers 1508.8 229) 07265 01S 
4| .467  .197 .0577* .0129 4) 487 211 .0630 .0143 
5| .472 .200 .0572 .0120 5| .479 204 .0588 .0124 
6] .478 .203 .0574 6| .476 .201 .0568 .0114 
12] .508 .0626 12 .208  .0567 .0101 
20 | .532 0685 .0123 16 | .498 .215 .0589 .0103 
4 2286 | 0723" 0111 8 =1.5 8'=2.0 5 =2.5 8’=3.0 
=1.0 7 =1.5 8 =2.0 &=2.5| 7 3] .577 234 .0556 .0088 
9 4] .535 .169 .0251 9 3] .397 .0989 .0126 
8| .577 5] .411  .103 .0119 .00075 
8 =0.5 &=1.0 &#=1.5 5 =2.0 8 =1.0 8 =1.5 3’=2.0  =2.5 
13° 3 .0496 tL <8 302 = .0470 
4 .0537 12: ))) 762, 324 - 0528 
5 372 0572 16 | .776 .841 .0577 .0033 
6 384  .0604 
8 403.0659 13 8] .667 .201 .0185 .00051 
16 4 775 .257 + .0214 .00046 8’ =0.5 8/=1.0 8’=1.5 5’=2.0 
25 3 .0774 16 5 0897 .0035 
4 .0858 6 | .932 .536 .0946 
5 .0927 12] .947 .585 .116 
6 .0986 
2116] .929 .440 .0388 
S14) © 573° <.0385 
5 .0422 AP RANTS) 6.0372 
5 .0466 
a 8 | .750 .0565 
*For this case 63 = (Bx)min = .0818 12 .0668 
16 .0743 


61 10 -088 — .0057 
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test recently published by the author [5], Table 1C of [3], which gives 
minimum values of the Type I error rate ox for the LSD test corre- 
sponding to as = 0.05, 0.01 for Duncan’s test, has been completely 
recomputed, and the corrected table is given below. Also given is a 
list of corrections to Table 2 of [3], which gives values of 6; = (62 axe; 
the maximum combined Type I-III error rate for Duncan’s test, for 
a = 0.05, 0.01. To reduce the length of the list, corrections of a single 
unit in the last place have been omitted, except when there are larger 
corrections for other value (s) of 6’ for the same values of N and m. 
[In both tables, NV represents the size of each sample, and m represents 
the number of sample means being compared. In Table 2, 6’ represents 
the ratio of the difference between the largest and smallest sample 
means in the group being tested to the pooled estimate s of the popula- 
tion standard deviation o]. Table 3 of [3], which gives maximum re- 
quired sample sizes (NV s)max for Duncan’s test for prescribed values of 
a, B, and m, has not been recomputed on the basis of the corrected 
critical values. If this were done, there might be an occasional change 
of 1, but most values are correct as they stand. 
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162 QUERY: Multiple Comparisons between 
Treatments and a Control 


For multiple comparisons between p treatments and one control 
Dunnett [1955] has tabulated values of ¢ from which confidence limits 
can be calculated. His tables provide for values of p from 1 to 9, and 
for degrees of freedom from 5 to infinity. The full tables allow for 
different numbers of tests (n) for each treatment, but when the number 
is the same for each treatment the number of degrees of freedom is 
given by (p + 1)(n — 1), so that a simpler table can be derived for p 
against n, including the following:- 
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Does it really make sense for an increase in the number of treatments 
in each test to lead to smaller significant differences when there are 
only two tests of each, but larger when there are four or more? Or, 
when there are three tests of each, for an increase in treatments to 
increase the size of differences which are significant at P = 0.05 but 
reduce those at P = 0.01? 


REFERENCE 


Dunnett, C. W., [1955]. A multiple comparison procedure for comparing several 
treatments with a control. J. Amer. Stat. Assoc. 50, 1096. 


ANSWER: 


At first sight the behaviour of the table does appear rather odd. 
Intuitively one would probably expect an increase in the number of 
treatments to lead to a larger significant difference in all circumstances. 
This would certainly be the case if the only error variation were that 
associated with the control and treatment means. But it must be 
remembered that the variance has to be estimated, and the value of t 
has to allow for error in its value too. The reason the value of ¢ de- 
creases with p for the small values of n is that there the number of 
degrees of freedom for estimating the variance is small and, in effect, 
we gain more from the increase in degrees of freedom than we lose as a 
result of the greater number of treatment means. 

If the table were extended to larger values of p, the value of ¢ for 
n = 2, and for n = 3 when P = .01, would pass through a minimum 
and then increase with the value of p, indicating that a point is reached 
at which any further increase in the number of degrees of freedom is 
not sufficient to compensate for the increase in the number of treat- 
ments. For values of n > 4(P = .01) and n > 3(P = .05), this is the 
situation which pertains for all values of p. 
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It may be verified that the same phenomenon occurs in the case of 
the Studentized range, if the percentage points of that distribution 
are tabulated with n as parameter instead of the number of degrees 
of freedom. 


C. W. DUNNETT 

Lederle Laboratories Diwision 
American Cyanamid Company 
Pearl River, New York 


163 QUERY: Error Rates in Multiple Comparisons 


Many biologists, realising the necessity for statistical examination 
of data, are unfortunately without access to professional advice, and 
must perforce search the literature themselves. One such worker 
encountered Steel’s description [1959] of a multiple comparison sign 
test, which seemed to fill an urgent need, yet on closer study to reveal 
an apparent paradox. As his example Steel applies his test to data 
from 13 soybean yield trials of 3 varieties, and concludes that one 
variety (Z) was significantly better than control (C) because it was 
inferior (—) to control in only 2 of the trials. If, however, there had 
been five varieties in each trial, Z would not have been significantly 
different from control, because with 5 treatments the number of minus 
signs must not exceed one for significance. And yet the inclusion of 
two more varieties in the trials could not have made one iota of differ- 
ence to the yield figures for Z and C from which alone the number of 
inferiorities was obtained. How can the presence or absence of data 
about W, Y, etc., entirely irrelevant to the Z : C comparison and to all 
the computations involved alter the probability that Z is the same as C? 
Or are there implied assumptions, not apparent to the amateur reading 
Steel’s paper, about the behaviour of W and Y relative to Z and C 
which determine when this test should be used in preference to Dixon 
and Mood’s [1946] simple sign test? 


REFERENCES 
Dixon, W. J., and Mood, A. M., [1946]. The statistical sign test. J. Amer. Stat. 
Assoc. 41, 557. 


Steel, R. G. D., [1959]. A multiple comparison sign test: treatment versus control. 
J. Amer. Stat Assoc. 54, 767. 
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ANSWER: 


This query seems to arise from a failure to recognize that several 
definitions of error rate exist. These, of which only two are considered 
here, were introduced in connection with multiple comparison pro- 
cedures and are just now finding their way into application and text- 
book. The decision as to which error rate is more appropriate and 
what significance level is to be used must, finally, be made by the 
experimenter. 

Error rates are defined on the basis that the null hypothesis is true. 

Per-comparison error rate = 

Number of comparisons falsely declared significant 


Total number of comparisons 
Here, the comparison is the conceptual unit. When comparisons 
are chosen without reference to the experimental results, a test such 
as the (parametric) /sd provides a per-comparison error rate at the 
tabulated value regardless of whether or not the comparisons are inde- 
pendent. A criticism of tests using this error rate is that, if many 
comparisons are made, the probability of finding at least one falsely 
significant difference is high. 
Experimentwise error rate = 
Number of experiments with at least one difference 
falsely declared significant 


Total number of experiments 


Here, the experiment is the conceptual unit. For pairwise compari- 
sons, we agree to test the largest difference when it occurs. If this 
difference is significant, then this experiment is one for which at least 
one difference has been declared significant. Tukey’s (parametric) 
test based on the studentized range has such an error rate; Dunnett’s 
(parametric) test of treatments against control also has this error rate. 

Clearly, tests with an experimentwise error rate will require a 
larger difference for significance than that required by the /sd. Further- 
more, the required difference will increase with the number of treat- 
ments. This becomes a basis of criticism: the probability of detecting 
a real difference of fixed size is lower when an experimentwise error 
rate is used and decreases with the number of treatments. 

Error rates are not, perhaps, as different as they first seem. If 
Q@¢ and ag represent per-comparison and experimentwise error rates 
respectively, then for k independent comparisons (1 — az) = (1 — ac)". 
For dependent comparisons, the relation is not as obvious. Harter 
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[1957] has tabulated comparisons of significance levels for a number of 
multiple comparisons test procedures intended to compare all pairs 
of means. 

It is apparent, then, that if an investigator chooses a definition of 
error rate, a test, and a significance level, then this information permits 
him to compute the significance level for any other suitable test and 
error rate. This knowledge eliminates any paradox. 

No statistical assumptions determine which test is appropriate, 
regardless of whether the test is parametric or non-parametric. One 
chooses an error rate and significance level, prior to the conduct of the 
experiment, to give the kind of protection desired. It seems to me 
that the basic problem concerns a power function, presumably related 
to “urgent need” and, in turn, the sample size necessary to accomplish 
clearly stated aims. 


REFERENCE 
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R. G. D. STEEL 

Institute of Statistics 

North Carolina State College 
Raleigh, N.C., U. S. A. 
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FALCONER, D. 8. Introduction to Quantitative Genetics. Hdinburgh and 


10 : 
London: Oliver & Boyd Ltd., 1960. Pp. ix + 365. Figures and Tables. 35s. 


W. F. Bopmsr, Department of Genetics, University of Cambridge, Cambridge, England. 


There has long been a need for a good introductory textbook on quantitative 
genetics, and Dr. Falconer’s new book satisfies this need more than adequately. 
The term quantitative genetics is here used in its broadest sense to include both the 
study of genetic changes in populations and the study of the inheritance of continuous 
variation. The genetic analysis of changes in quantitative characters in both natural 
and artificial populations requires a close interplay between these two aspects. 

The first five chapters of the book are concerned with the analysis of changes 

in gene frequency in natural and artificial populations. These provide perhaps 
the best elementary exposition I know of what is usually called population genetics. 
Dr. Falconer has a great gift for lucid and stimulating exposition. He presents 
the essential mathematics in a way that should be intelligible event to the least 
mathematically minded biologist. The numerical examples are excellently chosen 
to help in the understanding of the mathematical formulae as are also the graphical 
illustrations. In my opinion Dr. Falconer rightly assumes a knowledge of elementary 
genetics and later also of the elements of statistics. His book would have been 
unnecessarily burdened by any attempt to make it complete in these respects. 
' The genetics of continuous variation is treated in the following fifteen chapters 
with the same high standard of exposition. This subject has been approached in a 
variety of ways by different research schools and suffers from a confusion of termi- 
nology and notation. Dr. Falconer sets himself the difficult task of integrating the 
subject at an elementary level and is on the whole most successful. Itis not surprising 
if the subject matter is somewhat biased toward the approach of the Edinburgh 
‘school’, of which he is a member. Dr. Falconer is at least honest in delineating 
controversial topics and making it quite clear when a discussion is more a matter 
of personal opinion and preference than of established fact. 

Any criticism of the subject matter is perhaps more a criticism of the develop- 
ment of the subject than of Dr. Falconer’s account of it. However there is a notable 
lack of any emphasis on the importance of linkage, and the treatment of Mather’s 
concept of balance is rather brief. The importance of epistacy is also, in my opinion, 
underestimated. A good epistatic genotype may be quite rare, and so contribute 
only little to an interaction variance, and yet be of considerable importance to the 

‘ practical breeder. As Dr. Falconer says in his final discussion, there is need for much 
more experimental work. This should not, I feel, be directed at estimating more 
and more heritabilities and variance components for a wide selection of characters 
and organisms, as he suggests, but aimed at a more penetrating and specific analysis 
of the fundamental mechanisms underlying the inheritance of quantitative characters. 
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In conclusion, this is a book that should appeal to a wide audience. For the 
experienced geneticist it provides a stimulating and refreshing review of the subject, 
and for the newcomer to genetics it offers an excellent introduction. There is no 
doubt that it will be the best standard textbook of its kind for some years to come. 


GOLDBERG, 8S. Probability: An Introduction. Englewood Cliffs, N. J.: 
Prentice Hall, Inc. 1960. Pp. xiv + 322. 30 Figures, 44 Tables. $5.95. 


F. N. Davin, Department of Statistics, University College, London, England. 


This book, written by a pupil of W. Feller, has been tried out, we are informed, 
on the freshman mathematics class of Oberlin College. This should give some idea 
of the level of mathematical sophistication which is aimed at and which the author 
generally achieves. 

The first chapter on the manipulations and the algebra of sets is necessary for 
the main development of the book. It will be found difficult for persons wishing 
to learn probability with a view to practical applications, but the would-be mathe- 
matician will find it interesting, for the elements of mathematical logic are introduced. 
Probability in finite sample spaces follows. The author does not really attempt 
to justify the calculation of a probability, being concerned simply to define a one 
to one correspondence between the field of events and the probability set, and then 
to discuss the sub-sets involved. He says: ‘The theory of probability begins when 
a sample space, S, the mathematical counterpart of an experiment, is specified’’— 
an entirely proper point of view for a mathematican to take. The second chapter, 
which covers conditional and compound probabilities, Bayes’ formula and independ- 
ence, will be found rather stiff reading to the uninitiated. 

After a short discursus into elementary enumeration through the binomial and 
multinomial coefficients, with the arithmetic triangle and several combinatorial 
theories expounded, the remainder of the book is concerned with the properties 
of random variables. These are few brief notes towards the end on testing a statisti- 
cal hypothesis and on decision-making. 

For those persons who already know some probability theory this book will be 
enjoyable reading, if only to appreciate how familiar theorems can be wrapped up 
in mathematical language. For mathematicians in embryo who want to learn how 
a mathematician looks at the elements of probability theory this book will serve a 
useful educational purpose. For the statistician in embryo this book is not suitable 
since enlightenment can be achieved more simply elsewhere. (The reviewer has 
in mind, for example, An Introduction to Probability Theory and its Applications 
by W. Feller). But possibly the author did not intend to cater for this last class 
anyway. 


ABSTRACTS 


The following are abstracts of papers presented at meetings of the British Region 
held on October 27 and December 18, 1960. 


W. T. WILLIAMS (Botany Department, University of Southampton). 


745 ae ; : 
Some Applications of Electronic Computers in Plant Ecology and Taxonomy. 


In ecological survey the population under study consists of sample areas 
(quadrats) defined by the presence or absence of a number of plant species, whose 
names and ecological requirements may not be known. The problem is to find the 
major discontinuities in the population. This can, in theory, be solved by succes- 
sively subdividing the population so as to reduce the associations between the species 
by the greatest possible extent at each stage; this in turn involves setting up the 
correlation matrix at each stage and finding the species with the highest loading 
on the first principal axis of this matrix. To hold the entire matrix in a computer 
reduces unacceptably the space available for data. Instead of r, therefore, |r| is 
calculated, and the |r| values for each species summed as they are obtained. The 
species with the highest sum of |r| is necessarily that with the highest loading on 
the first centroid axis of the matrix (|r|); this has been found to be an acceptable 
approximation to the answer required. Examples will be given of the application 
of this method to actual areas. 

Valuable information on the structure of plant communities can be obtained 
by carrying out the corresponding inverse (Q-technique) subdivision; and, by 
collating the direct and inverse analyses, a completely objective account of the 
vegetation of a complex area can be given in extremely economical terms. 

In taxonomy it is sometimes desirable to carry out a conventional factor anlysis, 
with communalities and rotation. No programmes of this type seem to be available 
in Britain; but the standard programmes for the extraction of latent roots and 
vectors greatly reduce computational labour. Moreover, the existence of general 
“matrix schemes” makes calculation of specification equations a relatively simple 
task. 


J. N. R. JEFFERS (Forestry Commission, Wrecclesham, Farnham, Surrey). 


i Data Processing in Biological Research. 


The first task in the application of electronic digital computers to the statistical 
analysis of data arising in biological research has been the writing of the programmes 
for the actual analysis. This is, however, only a small part of the whole process, 
and the paper presents the results of recent work on the recording, compilation, 
- and storage of data on paper tape and magnetic tape. This work suggests the need 
for a complete revision of the methods used in handling biological data, and has 
shown very large savings in the cost of research and in the unproductive use of key 
research workers. Also considered is the problem of the output results from the 
computer in a directly intelligible form, though progress has been slower in this field. 


331 


332 BIOMETRICS, JUNE 1961 


F. YATES (Rothamsted Experimental Station, Harpenden, Herts.). Problems 


th Arising in the Use of Electronic Computers in Statistical Analysis. 


A small electronic computer has been in use in the Rothamsted Statistics 
Department for the last six years. It has been used for a wide variety of statistical 
problems arising in agricultural and biological research. It was early applied to the 
analysis of replicated experiments and a large number of such analyses are now 
performed every year. More recently its use in the analysis of surveys has been 
developed. A general programme which has been written for survey analysis will 
be outlined and used to illustrate the types of problem that are encountered in using 
electronic computers for statistical work. 


C. W. EMMENS (University of Sydney, N. S. W., Australia and Royal 
748 Veterinary College, London). The Planning and Analysis of Some Field 
Trials with Cattle. 


Three field trials of the fertility of deep frozen semen have recently been 
conducted in Sydney. They were as far as possible completely balanced, on a 
factorial scheme, and have given remarkably consistent results, with residual 
x2-values within expected limits. This is attributed to the care with which randomisa- 
tion and technique were controlled under difficult conditions. 


C. C. SPICER (Imperial Cancer Res. Fund, Lincoln’s Inn Fields, London). 


hoe Problems in the Analysis of a Large Scale Clinical Trial. 


The purpose of this trial was the evaluation of chemotherapy in the treatment 
and prevention of chronic bronchitis. It presents a number of points of interest 
both in planning and analysis. 


The following are abstracts of papers presented at a special meeting on ““BIO- 
METRICAL ASPECTS OF PLANT GROWTH” held in London on January 2, 
1961, by the British Region in conjunction with the Society for Experimental Biology. 


J. A. NELDER (National Vegetable Research Station, Wellesbourne, War- 


ee wick). Models and Experiments for Growth Analysis. 


The relationship of growth analysis to other forms of statistical analysis, partic- 
ularly those of long-term experiments and multivariate analysis, is indicated. The 
necessity for the development of methods for analysing the inter-related growths 
of different parts of a plant is stressed, and a first effort model of differential equa- 
tions analagous to linear regression is proposed. Types of deviation from the assumed 
model, both systematic and random, and the kind of experiment needed to detect 
them are discussed together with problems arising in the estimation of parameters 
in growth equations. Possible applications of the model are indicated. 
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8. C. PEARCE and C. 8. MOORE (East Malling Research Station, Nr. 
751 Maidstone, Kent). A Study of the Sources of Variation in Growth of Fruit 
Trees. 


A method of studying growth previously described (Biometrics 16, 1-6) is applied 
to annual records of apple tree trunk circumference. The purpose of the investiga- 
tion is to determine the main causes of variation in field trials with clonal fruit trees. 
Three standard errors are plotted against the logarithmic development of girth, 
namely, (a) log (girth), (b) increment in log (girth) since planting, and (c) log (girth) 
adjusted by girth at planting. A study of these curves enables the total variation 
resulting after a period of growth to be divided between initial variation, and initial 
growth rate negatively correlated with initial size differences, and a subsequent 
growth rate which is uncorrelated with initial size. Comparison of groups of trees 
having different treatments demonstrates the importance of good growing condi- 
tions in minimising variation. The reduction in positional variation by removing 
block effects and the results of cropping are also considered. The effects of these 
modifications in conditions accord with the theoretical model. Some measures are 
suggested for minimising error in field trials. 


M. J. R. Healy (Rothamsted Experimental Station, Harpenden, Herts.). 


gee Experiments for Comparing Growth Curves. 


Consider an experiment designed to study the differential effects of treatments 
on the growth curve of some organism. Suppose that the experimental subjects 
are allocated to the treatments by some process of randomisation and that observa- 
tions are taken when the treatments are first applied and at regular intervals there- 
after. Wishart suggested that tests of significance for the existence of treatment 
effects could be made by fitting a polynomial to the data from each subject and by 
applying standard statistical techniques to the coefficients of the polynomials. The 
paper discusses Wishart’s technique and extends it to provide estimates of the 
treatment effects. 

A general discussion was led by F. L. Milthorpe (University of Nottingham. ) 


CORRECTION 


E. N. Hey and M. H. Hey [1960]. The Statistical Estimation of a Rectangular 
Hyperbola. Biometrics 16, 606-17. 


The expression, nine lines from the bottom of page 609, should coincide with 
the column vector on the right of the matrix equation at the top of page 610. The 
solution for a, b, and c — ab on page 610 should have coefficients bracketed to agree 
with the matrix equation at the top of the page. In footnote 2, page 615, Ey in 
the matrix should be HE, and on page 617, line 6, x should be replaced by 2. 
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International 
ANNUAL FINANCIAL STATEMENTS 
Tuer Biometric Society 1960 
BALANCE SHEET 
Assets 
Cash: Bank Balance $9 ,605.90 
Petty Cash 19.40 $9 , 625.30 
Inabilities 
Subscriptions, 1961 $ 107.50 
Dues, 1961 54.50 $ 162.00 
Surplus, Jan. 1, 1960 (including petty cash) 7,048.64 
Gain for Period 2,414.66 9,463 .30 
$9 , 625.30 
Audited: B. G. Selvidge (Signed) 
Date: March 20, 1961 
INCOME AND EXPENDITURE STATEMENT 
Income 
Subscriptions, 1959 $ 358.75 
Subscriptions, 1960 6,225.00 $ 6,583.75 
Dues, 1959 105, 50 
Dues, 1960 2,466.00 2,571.50 
Sustaining memberships, 1960 650 .00 
Back dues and subscriptions 32.00 
Regional allotments 116.50 
BIOMETRICS allotment from sustaining members 250.00 
Back issues 93 . 50 
Member subscriptions to Journal of ASA 35.00 
Sale of directories 2.00 
Overpayments 20.50 
Printing—Harvard University Press 30.00 
Advance to Pharmacology Symposium 1,000.00 
Bank refund of service charge 1.00 
1,548.50 
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Less: Credits and regional allotments used 138.00 1,410.50 
Total Income $11,247.75 


Expenditures 


BIOMETRICS $6 , 996 . 28 

Postage 250.23 

Office supplies 2.89 

Printing 260.29 

President’s Office 200.00 

Member subscriptions to Journal of ASA 35.00 

Addressing and mailing services 88.40 

Pharmacology Symposium 1,000.00 $ 8,833.09 
Excess of Income over Expenditures 2,414.66 


$11,247.75 


Audited: B. G. Selvidge (Signed) 
Date: March 20, 1961 


SECRETARY’S ACCOUNT 
STATEMENT OF INCOME AND EXPENDITURE 
For the Period ist January, 1959 to 31st December, 1960 


Income 

Balance in hand, 31st December, 1958 £ 64.10. 4d 
Received from Treasurer 266.11. 3d 
Menbership dues received 1.12. =d 
£332.13. 7d 

Expenditures 
Office equipment and stationary £56. 2. 6d 
Secretarial assistance 150. —. -d 
Printing and binding 13.10) vd 
Postage 49.12. 1d 
Travel 29.15. 9d 
Books 18. 8. —-d 
Membership dues paid over 1.13. 6d 
£319. 2. 5d 
Balance, 31st December, 1960 e@ Maya itil PX0l 


I certify the above to be a true record of my transactions on behalf of the 
Biometric Society 


M. J. R. Healy 
Secretary 


I have examined the account book and vouchers produced by the Secretary 
and certify that the above statement is in accordance therewith. 


BE. Church, A.A.C.C.A. 
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STATEMENT OF OPERATIONS 


For the Year Ending January 31, 1961 
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Income 
Biometric Society: 
907 Subscriptions at 
983 Subscriptions at 


9 Sustaining Members at 


1161 Direct Subscriptions at 
Sale of Back Issues: 

Biometric Society 

Editor’s Office 


March 1960 Issues at Cost to 
Biometric Society 

Sale of Reprints 

Over Payments 

Payments to Regions 


Total Income from Operations 


Expenditures 


Cost of Printing Journals 
Issue No. 1 
Issue No. 2 
Issue No. 3 
Issue No. 4 


Mailing and Express Charges 
Issue No. 1 
Issue No. 2 
Issue No. 3 
Issue No. 4 


Cost of Printing Reprints (See Comments) 


Issue No. 4 (1959) 
Issue No. 1 
Issue No. 2 
Issue No. 3 


Mailing Charges Reprints (See Comments) 


Issue No. 4 (1959) 
Issue No. 1 
Issue No. 2 
Issue No. 3 


$ 4.00 
2.75 
25.00 


7.00 


$2,928.85 
PeEy ya 
2,988.44 
4,467.72 


$ 206.28 
229 .82 
310.70 
346 .56 


$ 290.69 
329 .43 
331.79 
370.75 


$ 24.01 
27.99 
22.75 
38 .94 


1,093 .36 


$ 3,628.00 
2,703.25 
225 .00 


$ 374.00 


4,474.00 


$13 , 729 .72 


$ 1,322.66 


113.69 


$ 6,556.25 


8,127.00 


4,848.00 


61.28 
1,537.75 
11.50 
6.50 


$21, 148.28 


$14 , 823 .08 


$ 1,486.35 
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Operating Expenses 


Stamps $ 297.28 
Office Supplies 523.68 
Mailing Lists and Addressographing USS 6745) 
Back Issue Moving Costs 182.94 
Back Issue Handling and Storage 329.76 
Back Issue Insurance 46.35 
Auditing 45.00 
Bank Exchange and Service Charges 2.43 
Refunds and Overpayments on Subscriptions 55.25 
Salaries and F.I.C.A. Taxes 525.02 
Transfers to Regions 6.50 
Express Charges 13.97 $ 2,159.93 
Total Expenditures from Operations 18,419.36 
Surplus from Operations $ 2,728.92 
Non-Operating Items 
Income 
Bank Interest $ 680.75 
Refund on Insurance Policy 31.66 
Bank Credit and Exchange 2.06 
Announcement 57 .00 
Refund on Blacksburg Postage Deposit 66.61 
Correction of Surplus from Previous 
Volumes (See Comments) 28.35 $ 866.43 
Expenditures 
Credits of 1959 Accounts Receivable 33.50 
Surplus from Non-Operating Items 832.93 
Gross Surplus, 1-31-61 $ 3,561.85 
BIOMETRICS, VOLUME 16 
BALANCE SHEET 
January 31, 1961 
Assets 
Accounts Receivable $ 2,089.25 
Bank Balances 
Savings Account #1 (Blacksburg) 10,397.02 
Savings Account #2 (Blacksburg) 9,637.50 
Savings Account #3 (Blacksburg) 3,168.41 
Checking Account (Tallahassee) 10,875.78 
Prepaid Insurance (Expires 4-25-63) 139.05 


Total Assets $36 , 307.01 
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Liabilities and Surplus 


Subscriptions to Volume #17 $ 5,993.75 
Subscriptions to Volume #18 182.00 
Subscriptions to Volume #19 42.00 
Subscriptions to Volume #20 7.00 
Surplus from Previous Volumes 26,520.41 
Surplus from Volume #16 3,561.85 
Total Liabilities and Surplus $36 , 307 .O1 
Comments 


(1) Not included in Assets is stock of back issues from Volumes 1-16. 

(2) Not included in Expenses is printing bill for December reprints of $507.20 

(3) Corrections of surplus from previous volumes results from a reconciliation of 
subscription card files and actual liabilities due to prepaid subscriptions. 


Audited February 17, 1961 
Richard Q. Conrad (Signed ) 


Region Belge 
L’Assemblée Générale annuelle de la Société Adolphe Quetelet s’est tenue a 
Bruxelles, dans la salle du Conseil de la Bibliothéque Royale de Belgique, le 25 
février 1961. Le Conseil d’ Administration pour 1961 et 1962 a été constitue comme 
suit: 
Président: Professeur Maurice WELSCH. 
Vice-Présidents: Prof. R. CONSAEL, H. LAUDELOUT, A. LECRENIER, 
J. REUSE et A. VAN DEN HENDE. 
Secrétaire: Dr. L. MARTIN. 
Secrétaire-adjointe: Anne LENGER. 
Trésorier: Pierre GILBERT. 
Membres: Dr. H. G. LION, 
Mme OSLET-CONTER, 
Mr. H. ROGGEN. 


L’ Assemblée Générale a été suivie d’une réunion organisée conjointement par 
la Société Quetelet, le Comité belge Whistoire des Sciences et la Société belge de 
Statistiqué Les exposés suivants ont été donnés: 


“Les rapports historiques entre Vanthropologie et la statistique’ par Monsieur 
A. LEGUEBE; 

“La, distribution des caractéres anthropométriques”’ par le Professeur C. GINI, 
Président de l’Institut International de Sociologie 4 Rome. 


British Region 
At a meeting held on February 28th, 1961, the following papers were read and 
discussed: 
P. D. Oldham—The Distribution of Arterial Pressure in the General Population. 
L. R. Taylor—A Power Law Transformation for Aggregated Populations. 
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WENCAR: 
TREASURER’S REPORT—DECEMBER 31, 1960 
Bernice Brown, Treasurer WNAR 
Receipts 
Balance forwarded from previous treasurer, Dec., 1959 $ 461.22 
Received from 1960 dues (143 @ 7, 10 @ 4) 
1961 dues ( 61 @7, 1 @4) 1472.00 
Received for orders of back issues of BIOMETRICS 16.00 
Received from Biometric Society (credit for one institutional 
membership ) 10.00 
Received from Biometric Society Treas. (credit for 4 transfers 
from ENAR) 4.00 
Received from ENAR for information circulars 41.60 
Total $2004.82 
Expenses 
To Treasurer International (143 @ 6, 10 @ 4) 898 .00 
To BIOMETRICS for back issues ordered by members 16.00 
To Marion Sandomire—mailing expense 11.15 
To printers for stationary and supplies 111.84 
To C. Zippin—partial expense to meeting 26.18 
To W. Becker—mailing expense 12.00 
To ENAR Treasurer—mailing expense 5.00 
To printers for notices of meetings 45.40 
To Pullman Herald for stationary 92.08 
Total $1217 .65 
On hand—checkbook balance $ 787.17 


Total $2004 . 82 


NEWS AND ANNOUNCEMENTS 


Members are invited to transmit to their National or Regional Secretary (af mem- 
bers at large, to the General Secretary) news of appointments, distinctions, or retirements, 
and announcements of professtonal interest. 


Robert W. Allard has been granted a Guggenheim Fellowship to study stochastic 
processes in genetics during a six months sabbatical leave to be spent at Oxford 
University in the Biometric Unit beginning February 1, 1961. 

Neeti R. Bohidar has accepted the position of Assistant Professor of Statistics 
in the Department of Applied Statistics, Statistical Laboratory, Utah State Uni- 
versity, Logan, Utah. He received his Ph.D. degree in Statistics from Iowa State 
University in 1960. 

Joseph G. Bryan, formerly Senior Operations Research Analyst at the Central 
Research Laboratory, American Machine and Foundry Co., recently changed 
positions and is now employed by Travelers Weather Research Center, Inc., 11 
Newport Avenue, West Hartford 7, Connecticut. 

William R. Buckland has resigned from London Transport Executive to take 
charge of the new Statistical Advisory Service of The Economist Intelligence Unit 
Ltd., 22, Ryder Street, London, 8.W.1. 

Lyle D. Calvin, Oregon State University, spent the 1960 fall term on sabbatical 
leave with the Department of Statistics, Harvard University. 

Douglas G. Chapman participated in the Fourth Annual Meeting of the North 
Pacific Fur Seal Commission held in Tokyo, January 20 to February 4 as a Scientific 
Advisor to the U. S. Commissioner. The North Pacific Fur Seal Commission is 
composed of representatives from Canada, Japan, U.S.S.R., and the U.S.A. 

Franklin A. Graybill recently left Oklahoma State University to become Profes- 
sor of Mathematical Statistics and Director of the Statistical Laboratory at Colorado 
State University, Fort Collins, Colorado. 

Iowa State University granted Doctor of Philosophy degrees, with major in 
Statistics, to the following people during the Winter Quarter graduation: Roger 
McCullough, Jose Nieto de Pascual, J. N. K. Rao and Thomas Neil Throckmorton. 

Gwilyn Jenkins, after spending a year at Stanford, is now back at his previous 
post as lecturer in Mathematical Statistics, University of London. 

Kuo Hwa Lu, formerly Associate Professor, Department of Statistics, Utah 
State University, has taken the position of Associate Professor of Biostatistics at 
the University of Oregon Dental School, Portland. 

Lincoln E. Moses, Professor of Statistics at Stanford University, is spending a 
year on sabbatical leave at the Department of Social Medicine, Oxford University, 
England. Dr. Moses received a Guggenheim Fellowship. 

Donald B. Siniff had taken a position as Biometrician in the Division of Bio- 
logical Research, Alaska Department of Fish and Game. He previously was at 
North Carolina State College working under the Southeastern Cooperative Wildlife 
Statistics Project. 
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TRAVEL GRANTS FOR ATTENDANCE AT THE INTERNATIONAL 
CONGRESS OF MATHEMATICIANS 


Travel grants will be made to a number of mathematicians who wish to attend 
the International Congress of Mathematicians in Stockholm, on August 15-22, 1962. 
It is hoped that funds available through various sources may provide travel assist- 
ance for a considerable number of mathematicians. 

There will be a greater effort than in the past to give aid to younger people. 
As grants will be made only to those who have filed applications, it is urgent that 
any who wish to receive a grant should fill out and file an application. Younger 
people are urged to file applications so that their cases can be considered. Applica- 
tions can be obtained from the Division of Mathematics, National Academy of 
Sciences, National Research Council, Washington 25, D. C. by requesting an applica- 
tion for a travel grant to the 1962 International Congress. 

The deadline for filing of applications is November 1, 1961, and an attempt 
will be made to announce the grants by January 1, 1962. 

Awarding of grants will be made only to those persons whose applications have 
been received, in good order, by November 1. The selection will be made by a 
committee consisting of the regular Committee on Travel Grants of the Division 
of Mathematics of the National Academy of Sciences—National Research Council 
enlarged to include representatives of societies affiliated with the Division and 
representatives of various governmental agencies. 


TRAINEESHIPS FOR PUBLIC HEALTH STATISTICIANS 


The Public Health Service has announced the availability of traineeships for 
graduate training of professional public health personnel during the 1961-1962 
academic year. 

Traineeships in public health statistics are available to qualified persons. They 
provide stipends from a minimum of $250 per month for a post-bachelor candidate 
to a maximum of $400 per month for a post doctoral candidate and additional 
allowances for dependents, travel of the trainee, and academic tuition and fees. 

Additional information and application forms may be secured from the Division 
of General Health Services, Public Health Service, U. S. Department of Health, 
Education, and Welfare, Washington 25, D. C. 


GRADUATE TRAINEESHIPS IN BIOMETRY 


Training programs designed to prepare students in the application of statistical 
and mathematical methods to biological problems, particularly those related to 
health and medical sciences, now exist in more than 20 universities throughout the 
country. Supported by training grant funds from the Public Health Service, NIH, 
these programs provide unusual opportunities for careers in teaching, research, and 
consultation. Employment opportunities for biometricians are excellent, with the 
demand by governmental and voluntary health agencies, medical research and 
educational institutions, and industry running far in excess of the available supply 
of trained personnel. 

Programs of study are individually designed to lead to doctoral degrees, and in 
special instances, to other academic degrees. Traineeship stipends are provided at 
various levels depending on previous education and experience of the trainee and 
include allowances for dependents. Substantially full economic support or partial 
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support may be provided, depending upon the proportion of time spent in training. 
Interested applicants are encouraged to correspond with one of more of the 
Program Directors listed below because course offerings, as well as specific research 
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problems for application of learned skills, vary from school to school. 


Dr. Virgil Anderson 
Purdue University 
Lafayette, Indiana 


Dr. George F. Badger 
Western Reserve University 
Cleveland 6, Ohio 


Dr. Jacob E. Bearman 
University of Minnesota 
Minneapolis 14, Minnesota 


Dr. Antonio Ciocco 
University of Pittsburgh 
Pittsburgh 13, Pennsylvania 


Dr. Wilfrid J. Dixon 
Medical Center, U.C.L.A. 
Los Angeles 24, California 


Dr. W. T. Federer 
Cornell University 
Ithaca, New York 


Dr. John W. Fertig 
Columbia University 
New York 32, New York 


Dr. Franklin A. Graybill 
Colorado State University 
Fort Collins, Colorado 


Dr. Bernard G. Greenberg 
University of North Carolina 
Chapel Hill, North Carolina 


Drs. John Gurland and T. A. Bancroft 
Iowa State University 
Ames, Iowa 


Dr. Boyd Harshbarger 
Virginia Polytechnic Institute 
Blacksburg, Virginia 


Dr. Allyn Kimball 
Johns Hopkins University 
Baltimore 5, Maryland 


Dr. Schuyler G. Kohl 
State Univ. of N. Y., Col. of Med. 
Brooklyn 3, New York 


Dr. Robert F. Lewis 
Tulane University 
New Orleans 12, Louisiana 


Dr. Eugene Lukacs 
The Catholic University of America 
Washington, D. C. 


Dr. Paul Meier 
University of Chicago 
Chicage 37, Illinois 


Prof. Felix Moore 
University of Michigan 
Ann Arbor, Michigan 


Dr. Lincoln E. Moses 
Stanford Medical School 
Stanford, California 


Dr. Hugo Muench 
Harvard School of Public Health 
Boston 15, Massachusetts 


Dr. Robert Quinn 
Vanderbilt University 
Nashville 5, Tennessee 


Prof. J. A. Rigney 
North Carolina State College 
Raleigh, North Carolina 


Drs. W. W. Schottstaedt and James Hagans 


University of Oklahoma 
Oklahoma City 4, Oklahoma 


Dr. Malcolm E. Turner, Jr. 
Medical College of Virginia 
Richmond, Virginia 


Dr. Colin White 
Yale School of Medicine 
New Haven, Connecticut 


Dr. J. Yerushalmy 
University of California 
Berkeley 4, California 
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For those unable to train during the academic year, an unusual opportunity 
is provided by a cooperative GRADUATE SESSION OF STATISTICS IN THE 
HEALTH SCIENCES sponsored by these Program Directors and made possible 
by a training grant from the PHS, NIH. For information concerning available 
stipends and course offerings at elementary, intermediate, or advanced levels for 
the summers of 1961 and 1962, write Dr. Jacob E. Bearman, University of Minnesota, 
Minneapolis, Minnesota. 


FELLOWSHIPS FOR MATHEMATICS GRADUATES 


The Division of Preventive Medicine and the Institute of Industrial Health of 
the College of Medicine, The University of Cincinnati, with the support of the 
Public Health Service, are instituting a training program in Epidemiology and 
Biostatistics. As part of this program training and experience will be given in 
biology and research to bright, young, and prospective candidates for a Ph.D. 
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The Department of Statistics at Roswell Park Memorial Institute has been 
awarded a grant by the National Cancer Institute and in conjunction with the 
University of Buffalo offers a two-year graduate on-the-job training program leading 
to a Master of Science degree in biostatistics. 
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sciences who will begin their graduate work next fall are sought as applicants for 
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Roswell Park Memorial Institute. Trainees will receive an average annual stipend 
of $2300.00 plus tuition. 

Requests for further information and application forms should be addressed to 
Miss Barbara A. Cote, c/o Dr. Irwin D. J. Bross, Chief, Department of Statistics, 
Roswell Park Memorial Institute, 666 Elm Street, Buffalo 3, New York. 
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THE GROWTH AND AGE-DISTRIBUTION OF A 
POPULATION OF INSECTS 
UNDER UNIFORM CONDITIONS 


E, J. WILLIAMS 
Division of Mathematical Statistics, 
C.S.I.R.O., Canberra, Australia. 


I. INTRODUCTION 


This paper deals with the growth of populations of insects such as 
aphids that reproduce continuously under uniform conditions. Its 
object is to derive the relations among the different parameters of the 
populations: the birth rate, the death rates at different stages, the 
age-distribution, and the intrinsic rate of natural increase (Lotka 
[1945]). Interest in these relations arose from the need to estimate 
the size of aphid populations by indirect means; the work presented 
here is supplementary to that of Hughes [1961], who was responsible 
for initiating the experimental investigations and who saw the possi- 
bility of estimating the rate of growth of an aphid population by means 
of a study of its age-distribution. 

Insects pass through a number of immature stages (instars) which 
may be readily distinguished. If the average duration of each instar 
is known, a count of the numbers of insects in each provides a con- 
veniently grouped age-distribution. By studying this age-distribution 
at successive intervals we can estimate the way in which the popula- 
tion is developing and thus indirectly determine the rate of growth 
of the population. 

Although the original investigation was of an aphid population, 
the problem will be framed in terms sufficiently general to apply to 
several types of insect with similar pattern of development. 


Il. FORMULATION OF THE PROBLEM 


We shall set out in rather general terms the conditions under which 
the population is developing, so that the results may be applied to 


different types of insect. 
We consider an insect with k instars, and designate the parameters 


of the population as follows: 
349 
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birth rate of adults’, 


By —— 
u; = death rate during the 2th instar, 
us = death rate of adults. 


For simplicity these parameters will all be assumed constant within 
the stage of growth (immature or adult) to which they apply. 

We denote by n;(t) the number in the ith instar at time 1, and by 
na(t) the number of adults at time ¢. 

We shall assume that the durations of successive instars are in- 
dependently distributed; although it could have been assumed that 
the lengths of successive instars for any insect are negatively correlated, 
so that an instar of unusually long duration will be partly compensated 
by a short one, the evidence for this assumption is not convincing. 

We shall denote by x; the duration of the 7th instar for some insect, 
and by X, the duration of the first 7 instars, so that 


DG ee AE EI 


Both 2, and X, are random variables, whose probability densities 
we shall denote by f;(z;) and f'(X;), respectively. Under the assump- 
tion of independence, the density f'(X;) is the z-fold convolution of 
the densities for the first 2 instars. 

We denote the corresponding distribution functions by F with 
appropriate affixes; thus 


E(z;) 


/ “ f.lu) du, 
0 
and 


F'(X;) 


ie f'(u) du. 


Since under uniform conditions the population will be increasing 
or decreasing at a constant rate, we shall find it convenient to express 
the population size in terms of the Laplace transforms of the various 
probability densities. We shall write 


gi(7) = ie e ”” f.(u) du. 


The Laplace transforms for the convolution distributions are simply 
products of those for successive instar distributions. 
The Laplace transform plays an important part in many biological 


1This is to be distinguished from the birth rate of the population as a whole, which is customarily 
denoted by \ (see Hughes [1961]). 
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problems. An interpretation of its biological applications has been 
given by Cox [1957]. 

The distribution of durations among survivors in any instar will be 
affected by the death rate. For the 2th instar with death rate p; , 
the density of a duration 2; will be proportional to 


ent: f:(x;) 
and will thus be given by 
gi(ai) = e** fis) /bi(us)- 


We shall now consider the age-distribution, and derive thence the 
rate of growth of the population. In these populations under favourable 
conditions, the rate of increase is high and mortality has only a small 
effect. We therefore first consider how the population would grow if 
there were no mortality. 


Ill. POPULATION GROWTH WITH NO MORTALITY 
The chance that an insect is in the 7th instar at a time ¢ after birth is 
(NO @ rie Cay) ee 0) OY 
likewise, the chance that it is an adult is 
Bee eG) vy, 
Now the number of births in an interval (u, w + du) is 
yna4(u) du + o(du), 


so the number expected in the 7th instar at time ¢ is 
nt) = 7 fi n,(u[Fo(t — u) — F(t — w)] du. 

We have in particular for the adult stage the result 

BO / * nal) PME — u) du. (1) 
Being of homogeneous type, the integral equation (1) has the solution 

malt) = na(O)e”', 
where p is the solution of the equation 
g = ¥ fe ef F(t — u) du, 

or 


: = | e * F'(u) du = 1 bib is Din (2) 
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where we have suppressed the argument of the Laplace transforms. 
This equation corresponds to the one given by Lotka [1925], p. 115, 
equation (25). 

In equation (2) the function F"(u) represents the probability that 
an individual is a member of the adult population at a time wu after 
birth. It thus corresponds to Lotka’s p(a) (the probability of survival 
at time a after birth), although p(a) is a decreasing function of a, 
whereas /’"(X) is an increasing function of X. The modes of develop- 
ment of the two populations are quite different: the individuals of 
one are subject to mortality, whereas the individuals of the other are 
subject to delay in reaching maturity. 

We can express the numbers of individuals in each instar in terms 
of the Laplace transforms with argument p. 


n(t) 


na) [MER = F'Qu)] du 


= 5 MaO)e"'bids mtn yin area a 


The total number of all stages at time ¢ is 
: na(O)e". (3) 


The fraction of the population in the 7th instar is 
gids °° $i-1(1 en $i), (4) 


and the fraction in the adult stage is 


Gido -** dy = (4A) 
independently of t. 

This exponential solution agrees with physical considerations, and 
describes the steadily increasing population to be expected when con- 
ditions have been constant for a long period. The development of the 
population is described by the constant growth rate p. 

If, as a result of changing conditions, the initial age-distribution 
is different from that given by the solutions (4), the solution for the 
distribution between instars will be more complicated. However, it 
is convenient in practice to confine observations to steadily increasing 
populations under uniform conditions; the behaviour of such popula- 
tions is fairly easily interpreted. 

We can also find the distribution of durations in any instar (the 
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instar age-distribution’), which is of some theoretical interest. In 
the steadily increasing population the number of individuals entering 
the ith instar a time x previous to observation is proportional to 


—px 


€ 


The number of individuals of instar-age 2 in the 7th instar is the number 
which have entered a time x ago, reduced by the factor 

Moe e 2 (x), 
for those which have left the 7th instar. Hence the probability density 
of instar-ages in the 7th instar is 


e [1 — F,(x)] pee) 5) 
i ges = F,(w)] du Li ¢:(p) 
0 
and the density of adult ages is 
pee (5A) 


The expressions (4) and (5) between them completely specify the 
age-distribution in a steadily increasing population without mortality. 
If we know the age-distribution between instars, we can in principle 
equate the expressions (4) to the known probabilities to determine 
the rate of growth of the population. 


IV. EFFECT OF MORTALITY ON POPULATION GROWTH 


When the effect of mortality is taken into account, we have to 
allow for the chance that the individual will die before reaching the 
snstar considered. Also, since we are assuming a different mortality 
in each developmental stage, a somewhat more elaborate analysis 1s 
required than was given in Section ILI. 

We assume that the net rate of growth of the population is p, and 
consider the movement of individuals through the ith instar. Since 
the death rate is u, , the number of individuals entering the 7th instar 
at a time x previous to observation and surviving is proportional to 


—(ptpidz 
€ er 


so that, as in Section III, the density of instar-ages is 


(o + went — FG] re 
hoe di(p =F Li) 
and the density of adult ages is 
(p ae ven oats (6A) 
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The effect of mortality in the 7th instar is thus to replace p by 
p + u, in the distribution of instar-ages. In general it can be seen that 
the effect of mortality is to replace p by p + yu; in all functions of the 
7th instar distribution. 

The number of individuals going from instar 7 — 1 to instar 7 in 
an interval (¢, ¢ + dt) is 


Mi-s(t) di(p + ui-1) 
eee Mi-1) 


me eg du + o(dt) 
0 


fs (p oe Mi-1)bs-1(p “1s Mi-1) dt 
a nia(t) g | ee: $i-1(p ss Mi-1) ws S )- 


Similarly the number going from instar 7 to instar 7 + 1 is 


nf) dt BEMIS + (an, 


where the argument of the Laplace transform has been suppressed 
since it is indicated by the subscript. 
Also the number dying is 


n,(t) dtu; + o(di). 


Hence, since the rate of population growth is p, we have 


Oe ie (ee @ 
and 
na(D(o + wa) = ny(d) OE ae (7A) 


Taking into account the birth rate y, we have 


n,(t) eae 


Thus the numbers in each instar can be expressed in terms of the number 
of adults living at the same time: 


ae yna(d) : 


nf) = yna(f) 82S — 40), ©) 


Similarly, from (7A) we have 


nat) = yn,() 


giving the equation satisfied by the growth rate: 
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Rarer eg a ier as (9) 


which corresponds to (2). 

It is convenient to express equation (9) explicitly as an integral 
equation similar to (2). In this integral equation the densities g, of 
durations adjusted for death rate replace the original densities f; . 

The basic relation for this representation is 


PO eye if en HE Cu) du, 


a os(us) | e “g:(u) du, 


= :(u:) v.(p), 


where y;(p) is the Laplace transform of the density g;(z;). This rela- 
tion expresses the Laplace transforms appearing in (9) as the product 
of two transforms, one depending on the death rate for the instar, the 
other on the growth rate of the population. 

Thus equation (9) is explicitly 


OER eh neem) 


= 1 (H1)b2(ue2) a Pi (ux) ¥:(p) Pop) ie i ¥(p) (10) 
= (1) bo( He) Pee bi (ux) IP e ™g*(u) du. 


Finally, equation (10) can be expressed in a manner similar to Lotka’s 
equation if we observe that, Y4(p) = wa/(p + ua). 


The equation then becomes 


MA = b.(us)balun) + dele) [| eg") du, (11) 
of 0 


where g*(X4) is the distribution of the total life span. Equation (2) 
clearly cannot be expressed in this way, since (11) would be meaning- 
less if there were no mortality. 

The reason why equations (10) and (11) differ from the original 
Lotka equation is that we have here assumed that the death rate is 
not directly time-dependent, but instar-dependent. Thus the rate of 
growth of the population depends on the distribution of durations of 
the different instars. When the death rate in each instar is the same 
(u; = u), equation (10) may be reduced to 
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asf ge Pree eMart. 
0 


21m 


which is comparable with (2) and with Lotka’s original equation. 


V. RESULTS FOR PARTICULAR INSTAR 

DURATION DISTRIBUTIONS 
The results of Sections III and IV are intractable unless some as- 
sumption is made about the distribution of the durations of the instars. 
We shall here consider two simple distributions which may be suitable 
approximations and may indicate the form that the results will take 
in general. We give the results when there is mortality, from which 

the results for no mortality can be easily deduced. 


(a) Constant instar length 
If the length of the 7th instar is a constant x; , the results take a 
fairly simple form. 
Firstly, 
$;(7) = Cniee 
The probability density of instar-ages in the 7th instar 


eae fetes 


Zann 
1 Sen eS 


(x < Sep 


= 0 (o> xX): 
The growth rate of the population is defined by 


as Baas : 
co exp — 2a (e+ mix ; 


and the number in the 7th instar is 


a -: Ce) exp = Ds (p me xs} 
Pick es | 


It is often reasonable to assume that death rate is independent of 
age. When this is done, the equations reduce to a simple form, which 
is found to give results corresponding closely with observation for 
many aphid populations. 

Putting uw; = pw, we have 


i-l1 
(1 _ Gee) exp {-( + L) ys xs} 
h=1 


n,(t) = ynald) Tay é 


n(t) = yna(t) 
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The growth rate of the population is given by 


2 = exp {~( + p) > ah. 


It is seen that, if the death rate is increased, the actual growth rate is 
decreased by the same amount. 

Hughes [1961] has worked with the assumption of constant death 
rate. He defines the sum p + uw as the potential rate of natural in- 
crease of the populations. The justification for this definition is that, 
if all causes of death could be removed, the rate of growth of the popula- 
tion would in fact be p + u. 


(b) Exponential distribution 
We here assume that the duration of the 7th instar is exponentially 
distributed with mean x; ; 


if —i/Xi 
ANCE )) Ae, 5 
fie) = = 


1 


The Laplace transform here takes the simple form 


(7) = AL aa) 


Then the distribution of instar-ages is given by 


1 oe (p ae Mix: Bo tert X Se 
d 
Xi 


which is of the same form as the distribution of durations, but with 
mean 


Xi 


Dap xs 
The growth rate of the population is given by 


hl Se 1 
i Tl + © + wl 


h=1 


‘an algebraic equation of degree k + 1. The number in the 7th instar is 


n,() =a ynal))xi 
I] [1 + (op + p)xa] 
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EMPIRICAL SAMPLING ESTIMATES OF 
GENETIC CORRELATIONS 


L. D. VANVLECK AND C. R. HENDERSON 
Cornell University, Ithaca, New York, U.S.A. 


Two methods of estimating the genetic correlation between two 
traits are commonly used. One procedure utilizes estimates of within 
subclass covariances as described by Hazel [1943]. An example is given 
by the analysis of observations on parent-offspring pairs. The other 
method uses estimates of the group variance and covariance components 
for two characters from an analysis within and between groups of 
relatives. The between and within analysis of sib or half-sib groups 
is an example of this type of analysis. The sampling variances of the 
estimates obtained in either of these ways has not been widely investi- 
gated. Robertson [1959] has derived an estimate of the variance of 
the genetic correlation estimated from the variance and covariance 
analysis of groups of relatives with observations on two variables. 
His procedure deals with the special case in which both traits have the 
same heritability and in which all subclass numbers are equal. ‘Tallis 
[1959] has given a general solution when subclass numbers are equal 
for estimates obtained from a between and within analysis of related 
groups. A general solution is, also, described by Mode and Robinson 
[1959] for genetic and genotypic correlations estimated from components 
of variance estimated from a four-way nested classification random 
model for the equal subclass numbers situation. For the parent- 
offspring method of estimation Reeve [1955] has given approximate 
formulae for the variance of the estimate. Apparently none of these 
procedures has been tested by empirical sampling. The purpose of 
this paper is to describe a procedure for obtaining empirical sampling 
estimates of genetic correlations in an attempt to learn something of 
the sampling variances of the estimates obtained from the parent- 
offspring analysis. Sampling variances of the empirical sampling 
estimates are then compared with the theoretical variances derived 


by Reeve [1955]. 
SAMPLING PROCEDURE 


Let us first consider a sampling scheme for the one-way classification 
model which may be extended easily to more complex situations. 


359 
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Suppose the model we are sampling is 
Yi = pa; + oy (C= Lose yep = ly eee. 
where 


u is the underlying population mean, 

a; is a random effect associated with the 7th class, and 

6,;; is a random effect associated with the jth observation 
in the 7th class. 


Further assume that the a; and 6;; are NID (0, o2) and NID (0, a3), 
respectively. Then the variance of Y;; isa; + 0;. To generate samples 
from such a population we need to draw samples from a NID (0, 1) 
population, multiply these random normal deviates by the appropriate 
standard deviations, and add to these products the population mean. 
Now Y;; = » + ¢,a, + o3e€;; , where the a; and e;; are NID (0, 1). 
If we take Y‘; = Y;; — uw, we have Y/; = o,a; + o:e;; . The sums of 
squares normally computed in terms of our sampling model are: 


Cc ni c 


De » bas > >: (o.4; i os€: i)” 


g=1 7=1 i=1 


c c 
2 2 
Ge) Myay ave, Ss aie:. + 03 SS bey 
1=1 7=1 3 


c (Y Ne f c 3 c : c Ps 
iy Sine + 20205 ares ais aT 
i=1 Ni; a i21 i=1 1; 


and 


Var 1 c 2 c 
ge a ee | ee( nas) sd 20401 neas)(e.) + oie |, 


(The usual dot notation signifies summation over that subscript.) 
It is apparent that in order to generate a sample with given subclass 
numbers it is necessary to compute only six terms 


c c c ni c 2 c 
(x n.d; , »s Glin, Dy Dee De a Deas eand e.) 
i= i= #=1 7=1 i=1 


i=1 Nn; 


which are functions of the random normal deviates and subclass num- 
bers. In order to complete the computation of the sampling sums of 
squares these functions can be multiplied by the corresponding constants 
which are of , 5 , and 2c,0; and added accordingly. Thus functions 
of deviates and subclass numbers can be generated and different sets of 
parameter values used with them in order to determine the effect 
of different ratios of the population variances on the sampling estimates 
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of the variance components. This procedure can substantially reduce 
the amount of sampling time required. It should be noted that such 
estimates will be correlated, but usually this will not be a disadvantage 
and actually may be of interest. Extension of such a procedure to a 
multiple classification is apparent. For the two-way classification with 
interaction 28 terms would be generated. 

A similar procedure may be used to generate estimates of genetic 
correlations. We are actually concerned with a four-variate model 
(2 traits on the parent and 2 on the offspring). For four-tuple samples 
of size N let the four-variate sampling model be: 


Xi; = Ares 5 

Xe Ase. a Aces, 

Xs: = AsCiz T+ AsOos 1 Acs: » 

Xap = Arerz + Aslos + AvCss PF Ar0las 5 


where the e’s are random normal deviates NID (0, 1) and the 2’s are 
constants determined by the variance-covariance matrix of the X’s, 


O14 G24 O34 G4 


The X’s can be arranged in such a way that two of them refer to a 
pair of traits on a parent and two of them to the same pair of traits on 
its offspring. In this paper let us consider the situation described by 


Table 1. 


TABLE 1 
ASSIGNMENT OF VARIABLES 


Trait Parent Offspring 
Ul Xi X3 
Il Xe Xs 


According to our model: 
eat oc =M+rutAs, 
g=N+M, % MB AEH AG + Ato » 


I 
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Fin = Airz y Go3 = Aodg + AsAs 5 
O13 = ira , O24 = AoAz + AzArsg , 
O14 = Aidz y O34 = AgAz + Asdg + Ayorg - 
Solving for the ’s in terms of the variances and covariances we obtain: 
M = 1, 
ee te On 
As = [02 — (o%2/01)]"”, 
Ni ==) Gin One 
As = (o23 — F12013/01)/[o2 nee (o42/0%)]”” 
de = [os — (¢is/a1) — (028 — or201s/01)"*/(o3 — (o12)*/o3)]"” 
N=" O44) Fx 5 
As = (24 | G1s014/ 0a) es = Gata 
No = {o4 — o1s014/0; — (023 — O12013/04) 


(04 ne 612044/03)/[o> ee (ois) foal} /{e; a (ois) (a; 


= (23 a 012013/'03) /[o> Ta CGA. 


and 
Aro = (a4 a (Gia) on ae (O24 <7 O12014/ 01) /|o3 aa (o12)"/oi] 
— {o34 — Gratia/or — (23 — 042043/0%) 
(G24 — o12014/01)/[o2 — (o12)"/oi]}*/[o3 — (o13)?/o% 
= (a359— 012013/0;) /[o% aa (012) /oi])'”. 


Next let us define the following quadratics of the normal deviates: 


N 2 N 
2 al C1 .€ 
pas Peg ee ie = Pa ere oa 
2 13 3 
i v N ? yD 171937 N 5) 
N e N 
D (AM 
L, = 2,6; — = Ing = 27 61:4; — 2 
p> a N ) 14 > 1i%47 N 5) 
N oe N E 
i os (pee Ie: 2.63. 
dX v N d 23 3 21937 N ? 
Fa Fee = tt 
iby = e Aes ine €2 C4. 
‘ Py Cs€4.0— 
re N ) 4 a 21047 N ) 
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Using these definitions and the procedure described previously, the 
empirical sampling variance and covariance estimates with (VN — 1) 
degrees of freedom are the following functions of )’s, normal deviates, 
and N: 


éi = Oily) /W — 1), 

62 = (IL, + OAGL2 + DWsdrsLa2)/(N — 1), 

62 = [A2Ly + Le + Ls + WAarsLa2 + Adela + Aerdelies) | / IY — 

62 = [\2L, + NLe + AGLs + Mola + 2ArreLae + Avolaz + ArAroL4 
L hedoline f daMiclas + Movs) //(NV = Y, 

bra = aideliy FH MAslie)/(N — 1), 

bra = (Aidala + MsAsLs + ArdeLas)/W — 1), 

bse = adel + MadeLiz + MdoLas + rola) /(N — 1s 

boa = [Nadaln -E AsdsLe + Cds + dada) Lae 
BD Lae Apel) C= DG 

do = [NadrLa + dads + (dads + Aade) Laz + Aodolns SES aah 
SO peNs hanna eel), 

and 

bea = [deli + AsdsLn + AodoLs + sds + Asdz) Lia + (Ado + AeA7) Lis 
4. (Ngho + Aede)Las + Asdrolrs + Asdiobas + Na Nielal/ (Neel) 


These sample estimates may be used to construct estimates of 
genetic parameters which have known population values. The herita- 
bility of trait J may be estimated by 


hz = 2613/6163 
and of trait JJ by 
Niz = 2624/6264. 
The genetic correlation is estimated from residual covariance com- 


ponents according to Hazel [1948] by four methods: 


Op a Gag (Cuda) 6 Op = EGR ee 


(G14 = ey irctanr aan and a (Crt 


{| 


Ys 
It should be noted that the genetic model described here assumes & 
random mating population where gene effects approximate to the four- 
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variate normal distribution. And, of course, if selection is applied to 
the population then the parameters will probably change. This model 
also does not consider effects due to linkage, sexual or cytoplasmic 
differences. 


SAMPLING RESULTS 


Samples were generated with N = 100 for the four-variate sampling 
model which has been described. Twelve hundred sets of the L’s were 
obtained with 99 degrees of freedom for each set. These 1200 sampling 
coefficients were then combined to form 120 sets of L’s with 990 degrees 
of freedom and 240 sets with 495 degrees of freedom. Then 24 sets of pa- 
rameter values (see Table 2) were used with the sampling coefficients to 
construct sample estimates of the genetic correlations (9; , go , gs , and gs). 
The true genetic correlation, go , and the heritabilities of the two traits, 
h; and hj, , for each parameter set are shown in Table 2. 

The parameter values were assigned as follows: nine parameter 
sets (1-8, 7-9, and 13-15) are constructed so that the two traits have 
equal variances and heritabilities; nine other parameter sets (16-24) 
are constructed so that the variances of trait IZ are half the variances 
of trait J and the heritabilities of the two traits are equal; and the 
remaining six parameter sets (4-6 and 10-12) are constructed so that 
the two traits have equal variances but unequal heritabilities. Three 
values were chosen for heritabilities and genetic correlations, 0.2, 0.5, 
and 0.8 with the exception of sets 19 and 20 where the genetic correla- 
tions were .14 and .35. 

The genetic interpretation of parameter sets 13, 15, and 22 is not 
possible. Inadvertently, covariances used in these sets were assigned 
which do not admit explanation by the usual genetic theory in that 
the environmental covariances between traits I and IZ exceed the 
environmental variances of traits J and IZ. Another inconsistency 
occurs in the parameter sets 4-6 and 10-12 where the phenotypic 
correlations between traits J and IJ are different for parent and off- 
spring thus implying different environmental correlations. These 
differences were not intended but they do not invalidate comparisons 
with the theoretical variances since the derived estimates of the sam- 
pling variances depend only on a four variate distribution not necessarily 
subject to genetic interpretation. These inconsistencies should, how- 
ever, be noted. 

The means of the sample estimates are presented in Table 3. Esti- 
mates were discarded if the denominator of the estimate had a negative 
component or if the signs of the numerator components of gs were 
different. If the signs of the numerator components of g, were both 
negative, the sign of g, was considered to be negative. 
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TABLE 2 
PARAMETER VALUES UsEep IN ConstrructinG HsTIMATES 
or GENETIC CORRELATIONS 


Parameter 


a 2 2 2 2 
Set hy® hyy® goo Co LOS Uae ECT O12 O13 O14 O23 O24 O34 


1 20 .20 .20 | 1000 1000 1000 1000 | 100 100 20 20 100 100 
2 20 .20 .50 | 1000 1000 1000 1000 | 100 100 50 50 100 100 
3 20 .20 .80 | 1000 1000 1000 1000 | 100 100 80 80 100 100 


4 | 50 .20 .20 | 1000 1000 1000 1000 | 250 250 32 32 100 100 
5 50 .20 .50 | 1000 1000 1000 1000 | 250 250 79 79 100 100 
6 50 .20 .80 | 1000 1000 1000 1000 | 250 250 126 126 100 100 


fi 50 .50 .20 | 1000 1000 1000 1000 | 250 250 50 50 250 250 
8 50 .50 .50 | 1000 1000 1000 1000 | 250 250 125 125 250 250 
£ 50 .50 .80 | 1000 1000 1000 1000 | 250 250 200 200 250 250 


10 50 .80 .20 | 1000 1000 1000 1000 | 250 250 63 63 400 400 
11 50 .80 .50 | 1000 1000 1000 1000 | 250 250 158 158 400 400 
12 50 .80 .80 | 1000 1000 1000 1000 | 250 250 253 253 400 400 
13 30 .80 .20 | 1000 1000 1000 1000 | 400 400 80 80 400 400 
14 80 .80 .50 | 1000 1000 1000 1000 | 400 400 200 200 400 400 
15 -80 .80 .80 | 1000 1000 1000 1000 | 400 400 320 320 400 400 
16 20 .20 .20 | 1000 500 1000 500 | 100 100 14 14 50 100 
17 20 .20 .50 | 1000 500 1000 500 | 100 100 35 35 50 100 
18 20 .20 .80 | 1000 500 1000 500 | 100 100 57 57 50 100 
19 50 .50 .14 | 1000 500 1000 500 | 250 250 25 25 125 250 
20 50 .50 .35 | 1000 500 1000 500 | 250 250 62 62 125 250 
21 50 .50 .50 | 1000 500 1000 500 | 250 250 88 88 125 250 
22 80 .80 .20 | 1000 500 1000 500 | 400 400 57 57 200 400 
23 80 .80 .50 | 1000 500 1000 500 | 400 400 141 141 200 400 
24. 80 .80 .80 | 1000 500 1000 500 | 400 400 226 226 200 400 


sHeritability of trait I. >Heritability of trait II. °Genetiec correlation between traits I and II. 


All four methods of estimation appear to provide unbiased estimates 
of the genetic correlation when the sample size is large. For small 
sample size (V = 100) and low heritability (.20) of at least one trait, 
however, the means presented in Table 3 indicate the estimates may be 
biased upwards. The bias apparently increases with an increase in the 
genetic correlation when heritability is fixed. It is also worth noting 
that the bias is larger in nearly every case for g, than for g; , J2 OF Js 
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TABLE 3 


MANS OF SAMPLE ESTIMATES OF GENETIC CORRELATIONS—ASSOCIATED 
WITH PARAMETERS IN TABLE 2. 


1200 Samples f = 99 | 240 Samples f = 495 | 120 Samples f = 990 
Parameter 
Set Oi Op is fi. G2 93 94 9. 92 93 Ga 
1 VP, PF PRB - Bhi PRE lee Ake PA) (20) 2116-189220 
2 ROOM OOM OL MAS hoy Gl A ey JH 2G 2) 60 
3 1.06 1.12 1.09 1.24 .95 .89 .92 .89 .84 .79 .82 .78 
4 Pal £56} giles Pal OPP ss OR OR Py oleh RU 8 
5 8 hil be oa oy, 0) ah 4 .51 .48 .49 .48 
6 58) 568) 02 Cb .84 .81 .83 .80 £82)578 207s 
ai silts) S0Ksy 5d Pal 220 ee See 20 220/718 219 219 
8 (oe 6) Be ER .50 .48 .49 .48 .50 .48 .49 .48 
9 PSSM SONS / moo mel) oH) ott) orts} .79 .78 .79 .78 
10 aikes 1 Sal Pa 19) SalS ee Lo eat9 LOR TS LOS 
11 J 0) Si a0 .50 .49 .50 .49 .50 .49 .49 .49 
12 LOOM OOM COMO! 280 79 S09 .80 .79 .79 .79 
13 wllsy GIS 31K Wi 2 Oe Oe Om ao .20 .19 .19 .19 
14 .50 .49 .49 .48 .50 .49 .49 .49 .50 .49 .49 .49 
15 door cel) oll 627 .80 .79 .80 .79 .80 .79 .79 .79 
16 PP, NS TS I sales ZS SG OPAL oalzé Ale! AG) al 
17 oe) Ol 0 20} .06. 152) .54 256 .50 .47 .49 .50 
18 109) 97 1203) 15 .94 .90 .92 .89 .84 .80 .82 .79 
19 10 08 08 10 mle Th Ty Sas pA Liles Lee 
20 sos fall) BP Se sad) 9a) gexsk 388) POON OOMOAEEOD 
21 202, 244 49 251 VAD oa aS a .50 .47 .48 .48 
22 oll, le oll 2X0) sil ale) say oe) 220,420) 520 219 
23 49 .47 .48 .48 .50 .49 .49 .49 .50 .49 .49 .49 
24 Oe Oumee SO TS ste) 7) 7) 78) .80 .79 .79 .79 


af is the number of degrees of freedom associated with each sample. 


when there is evidence of bias. This increased bias is probably caused 
by discarding estimates of g, when the numerator covariances differ 
in sign since in these cases g; , g2 or gs would usually be small or negative. 

The sampling variances of the estimates of genetic correlations were 
computed as >", (g;; — g;.)’/(m; — 1) where m; is the number of 
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estimates for the jth estimation method. These sampling variances 
are shown in Table 4. 

Reeve [1955] gives approximate formulae for the large sample 
variances of the estimates g; and g,. The variances are equal for the 
two estimates. This variance is (in our notation): 


2 22 2 2.2 2 2 9 D) 9 2 
V(Q) a (es aL 0203 ae 0103 a 0204 012034 13924 401034 
== le ares ik 2 2 
: Af 14 023 013 O24 014093 F14093 G14913 
2 2 2 
201204 201203 209034 2012034 2014023 1); 
7 = rag am ’ 
014094 923013 023024 013024 0139024 


where f is the degrees of freedom associated with each component 
estimated. Using the same method as Reeve the variances of g, and gp 
can be derived as 


2 oun 22 22 2 2 1 
gi { F203 0103 0204 71203 G 2034 712034 014023 4 
Vigng= = oS Sy = = 9 ; Fr 9 
f 023 4o%3 404 923013 023024 013024 
and 
2 2.2 22 22 2 2 1 
_ G2 (9104 0103 G204 91034 T3204 012034 + O14023 
V (ge) = ee i a = _ 9 = 5) 
f O14 4043 4o04 014013 914024 013024 


The large sample variances of the genetic correlations corresponding 
to the 24 parameter sets are listed in Table 5. The expected variances 
of g, and g, are about double those of g; and g,. The expected variances 
of g, and g, are slightly different for parameter sets 4-6 and 10-12 for 
which the heritabilities of the two traits are different. 

The computed sampling variances of the estimates of g, and g, are 
approximately twice those of g; and g, as expected. The agreement 
between the expected and computed sampling variances of the estimates 
is poor for the small sample size (f = 99) except for the cases in which 
the heritability of both traits is high (0.80). The differences between 
computed and expected variances are much less for f = 495. The 
variances of estimates which still deviate most from the expected are 
those associated with low heritability (0.20) of both traits. For a 
relatively large sample size (f = 990) the agreement between expected 
and computed values is surprisingly close for all combinations of param- 
eters used in this study. 

The extremely large sampling variances which occurred with some 
parameter sets are probably due to large estimates in turn due to small 
values of one or both of the denominator covariances. This suggests 
that heritability plays a dominant role in determining the sampling 
variances of genetic correlation estimates. The coefficient of variation 
for one of the denominator covariances under the normal distribution 
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TABLE 5 


EXPECTED VARIANCES OF GENETIC CORRELATIONS 
FOR THE PARAMETERS OF TABLE 2.* 


i= f = 495 f = 990 
Parameter = 

Set Vir) Vigo) Vigs) | Vig) Vigs) Voge) | Von) Vig2) VCs) 
1 990 .990 .495 198 .198 .099 099 .099 .050 
2 1.036 1.036 .540 2207, £207 [08 104.104 .054 
3 1.174 1.174 .676 235 286 ©2185 S17 Me gals 
390  .879 .192 078 .076 .038 039 .038 .019 

5 414 .385 .206 083.077 .041 041 .038 .021 
6 490 .444 .272 098 .089 .054 049.044 027 
“ 148 .148 .078 030 .030 .016 015 .015 .008 
8 A42. 142 2070 028 .028 .014 014, .014 .007 
9 Asie Able 077 030.030 .015 015 .015 .008 
10 090 .089 .049 018 .018 .010 009 .009 .005 
11 083 .080 .040 017.016 .008 008 .008 .004 
12 088 .082 .041 018 .016 .008 009.008 .004 
13 054 .054 .033 O11 .011 .007 005 .005 .003 
14 046 046.023 009.009 .005 005.005  .002 
15 045 .045 .020 009 .009 .004 004.004 .002 
16 974 .974 .483 95 23195 .097 097.097 .048 
17 994 .994 .503 199 .199 .101 099.099 .050 
18 1-113. ic ite. 2620 (2980 2023 9 124 AIL) Ute 062 
19 147.147. ~.081 029 .029 .016 015 015 .008 
20 piBo 2132 = 4066 026.026.0138 013 2018.99.007 
21 126, “126. 059 025 .025 .012 013.013 .006 
22 050 .050 .034 010 .010 .007 005.005 .003 
23 037 .037 .019 007 .007 .004 004 .004 .002 
24 032  .032 .012 006 .006 .002 003.003 .001 


aThe expected variance of g4 is the same as that of gs. 


is 100-V/ (4 + h’)/ fh’ . Similarly we can define the average coefficient 
of variation for the two denominator covariances associated with the 
genetic correlation to be 


100 ( 4+ hj ae) 
Ae Aan So meh 
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This average coefficient of variation turns out to be useful as a “rule of 
thumb”’ for determining when the theory approximation of the variance 
of the genetic correlation is likely to be good. We can note that when 
the average coefficient of variation, A, is 20 percent or less the theory 
approximations in Table 5 are quite closely in agreement with the 
empirical sampling variances summarized in Table 4. 

This guide line would apply equally as well to estimates of genetic 
correlations as to their sampling variances. When the average coefficient 
of variation is larger than 20 percent then the estimates are likely to be 
biased upward. In fact, if prior knowledge indicates the average 
coefficient to be greater than 20 percent then some thought should be 
given to whether or not the genetic correlation should be estimated. 


CONCLUSIONS 


The results given in Table 4 indicate for sample sizes of 1,000 or 
more that the approximate formulae of Reeve [1955] for the large sample 
variance of the genetic correlation are accurate. For smaller sample 
sizes (500 or less) the approximations are not accurate unless the 
heritabilities of the examined traits are high. When the sample size 
is 100 or less the approximations may tend to be very misleading. 

It seems safe to say that investigators who are estimating genetic 
correlations need at least 1,000 sets of observations in order to obtain 
reasonable estimates of the sampling variances of the estimates. Even 
then the sampling variances of the genetic correlation estimates may 
still be too large for the estimates to be of use especially if heritabilities 
of the traits are low, for example h” < 0.20. A useful guide may be 
the average coefficient of variation for the denominator covariances 
associated with the genetic correlation which depends on heritability. 
If this coefficient is 20 percent or less then the theory approximation 
may be quite good. 

Estimation by either of procedures g; or g, seems preferable since 
the sampling variances are only about half as great as for methods 
g. OF gx. Procedure g; appears to be better than g, especially for small 
sample sizes since the results have indicated g; is less likely to be biased 
than g, probably because more estimates of gs will usually be discarded 
as imaginary or uninterpretable. 


SUMMARY 


A method of generating samples from a normally distributed popula- 
tion is described which has the advantage of being parameter free. 
Sample coefficients can be developed with zero means and unit variances 
and then different sets of parameter values used with the sample 
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coefficients to develop sample sums of squares and crossproducts from 
populations with different parameters. This method is used to con- 
struct sample estimates of genetic correlations for 3 sample sizes and 
24 sets of parameter values. The sampling variances of these estimates 
are compared with those expected by application of Reeve’s [1955] 
formulae. 
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FOR THE LOGISTIC MODEL’ 


James HE. GRIzZLE 


Department of Biostatistics, School of Public Health 
University of North Carolina, Chapel Hill, North Carolina, U.S.A. 


1. INTRODUCTION 


Data collected in many types of research often consist of the 
proportion of experimental units having a specified attribute. In 
this paper we shall be concerned with the analysis of this type of data 
when it can be arranged in a multiway classification as for example, 
a factorial arrangement. An example of this type of data is given 
in Table 1. 

Several methods may be used in analyzing these data. Regardless 
of the method, a model relating the proportion having the attribute 
to the treatments must be assumed for a meaningful interpretation of 
the experimental results. The problem of the most appropriate model 
becomes particularly acute when some of the treatments are applied 
at several levels and it is desired to investigate the nature of treat- 
ment effects with regard to both main effects and interactions. If 
the sample sizes in the cells are equal, the observed proportions are 
often analyzed by the analysis of variance. A more desirable procedure 
is to analyze the arc sine transformation of the proportion. It is well 
known that this transformation stabilizes the variance if the sample 
sizes are equal and not too small, and, for a large class of data, it pro- 
vides a unit of measurement on which treatment effects are approxi- 
mately linear except at values of the proportion near zero or one. 

In bioassay both the logit and the probit transformations have a 
long history of use. With appropriate extensions these models can be 
used in analyzing data of the type being discussed. Since the propor- 
tion responding has been found to increase sigmoidally with increasing 
stimulus for many phenomena, these transformations are particularly 
effective in providing a scale on which treatment effects are linear. 
Dyke and Patterson [1952] gave an example of how the logit trans- 


1This research was sponsored by Cancer Chemotherapy National Service Center of the National 
Institutes of Health as part of a research contract. 
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formation can be used in analyzing a 2” factorial arrangement. Their 
method can be adapted to other arrangements by an obvious extension. 

The purpose of this paper is to present a new method of testing 
hypotheses when the logistic model is used. It is shown that estimates 
of the cell probabilities required to make certain tests can be used to 
obtain maximum likelihood (ML) estimates of the parameters in the 
assumed model with less computation than is commonly associated 
with ML estimation with models of this type. The approach used 
here was suggested by the work of Reiersgl [1954] and Mitra [1955]. 


2. NOTATION 


To avoid multiple subscripts we will number the cells? = 1, --- ,r. 
It will be assumed that the relationship between the treatments and 
the probability P; of a response is described by 


1 = [log, (P:/Q,)] = A8, & <r, 


where A is an r by k non-singular matrix of known constants determined 
by the model and the design of the experiment, 6 is a k-element column 
vector of unknown parameters, | is a k-element column vector, and 
So ee aoe 

The following additional notation will be used: 


n; = sample size in the 7-th cell, 

u; = number of responses observed, 

Bp = Ne abs, 

C;. is the 7-th row vector and 

C_, is the 7-th column vector of a matrix C. 


3. THEORY 


Mitra [1955] and Diamond [1958], proceeding along the same lines 
of proof as Cramér [1945], have given the mathematical properties 
that the model and the hypotheses must have for the test statistics 
to be asymptotically distributed as x’ when the null hypothesis is true, 
and for the existence of unique consistent estimates of the parameters 
in the model for samples from multinomial distributions. Their results 
are given as general forms with the model and the hypotheses being 
unspecified except for certain analytic conditions. It is easily demon- 
strated that the logistic model and tests of linear hypotheses have 
the required properties, Grizzle [1960]. 


3.1 Tests of Hypotheses. 


Let it be given that the logistic model fits the data except for chance 
deviation. If we wish to test the hypothesis 
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He 2 C103; Gab 
where rank C* = t,t < k <7, and «isa vector of preassigned constants, 
we construct a matrix C such that 

Cl = C*0 = ¢, (3.1.2) 


where C* = CA, if H, is true. Then using (3.1.2) as restraints on the 
likelihood, estimates of P; are obtained. 
The logarithm of the likelihood subject to the restraints 1s 

@ = constant + >> (u,; log P; + v; log QO) — A (Cle— =x); (3.1.3) 


where 2 is at X 1 vector of Lagrangian multipliers. 
Taking partial derivatives, equating them to zero and solving for 
P, , we find 
Pre Gn UC eee = ae (3.1.4) 
To complete the solution, 2 must be eliminated from (3.1.4). 


Let 1 be the vector of elements /; = log. (P;/Q;), where P, is defined 
by (8.1.4). Then (3.1.2) implies 


Clix, (3.1.5) 
or its antilog 
1g! (“ SEGING aero = male (3.1.6) 


can be solved for 2. Either of these sets of equations can be solved by 
the familiar Newton-Raphson method which will be given in Section 
4.1. In general, the form to use for ease of solution depends on ¢,;; . 


4, COMPUTATION 


4.1 General Solution. 


Since it is not possible to derive an explicit solution for 4 from 
(3.1.5) or (3.1.6), we resort to the Newton-Raphson method of solution. 
Thus we expand (3.1.5) about some guessed value of 2, A> say, neglect 
all derivatives higher than the first-order and solve the resulting equa- 
tion for AX = 2 — A -Expanding C1 about the point 2 , the first-order 
approximation to (3.1.5) is given by 


Cis —= (© ID) (OY AX Gis (4.1:1) 


where D is a diagonal matrix of elements 1/ (n,P,Q,), and 1, and Dy 
are 1 and D evaluated at 4). Therefore, 


AX =(COD3C) a Clare (4.1.2) 
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If, as in many problems encountered in practice, + = O, then 
= (C D.C’) “Cl, . (4.1.3) 


A more accurate solution can be computed by letting A = A + Ar 
become the 2, for a second iteration. The process is repeated until 
Ad is as small as desired. Then, if Ho is true, 


LENS OU bee, (4.1.4) 


where D, is taken from the last iteration, approaches the x’-distribution 
with ¢ degrees of freedom as the n; get large. 

There are two special cases for which computational formulas can 
be derived which do not involve matrix inversion. The first, the test 
with one degree of freedom, can be obtained in an obvious way from 
(4.1.2). The second is the test for homogeneity of a group of treatment 
effects. This is particularly important because many tests of inter- 
action can be put into this form. 


4.2 Computation for the Test of Homogeneity. 


For the method to be useful we must be able to write Hy as 
6,= 60 = --- = 6,. Insome problems this is easily done; for others 
the model may have to be reparameterized, and for some it is not 
possible. This eae can also be written in the form 6, — 6, = 
6, — 0, = 2+ = Ora — 6, = 080 that C implied by the test is non- 
singular. The Puen i be more clear if we renumber the /; to 
pecomeds ,t = 1-8-5497 = L775: 

Given that the saedel fits the ies ae test the hypothesis 


Hy: =0=°-:: = 4 


there must exist c;, such that 


ys Cul; = 6, 


ae (4.2.1) 
De Ceshes — 6, . 
Hence, 
Do Casha; - De Cusles - 0, 
» sili = X Cle; = 9, (4.2.2) 


cer sles4 = Decula = 0, 
A i 
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if H, is true. Using (4.2.2) as restraints on the likelihood in estimating 
the P;; , we find 


Pi, = (Wis — eesdd ney | 1 = Ih, Oe ,6— bg =) ) 8, (4.2.3) 
and 
t—I1 
Pi, (il, Hcy Sonal ee jr Lycee. 
i=1 
If we-let Do (2) Xe = —\, so that dois = 0, and substitute P,, 
into (4.2.1) we have 
Up Ca AT ye c= ae 
de log (250% ist tor = Osha) = mee C.; log eae): (4.2.4) 


Expanding (4.2.4) about trial values \;> of the \; , where ys Av = O; 
we have 


fhe <a 6A, 81 a ee fhe cd 6\,8; ) (42>) 
where 
If “ : es Uis Te: cuih) 
T; = a. c;; log (4 mE ey: 
te 1 1 ) 
pe yD a (— — €;;\i0 a Viz 1 Ci;Ai0 ; 
and 


OA; = Ai — Awo- 
Choose the 7-th and k-th members of (4.2.5) and solve for 6\,. Then 
de = Ty Se — TH Sp ONS Sie eee = yee 
Now )>.; 6\, = 0, and thus 
H-—= TS + OAS. Ss =20; 
where 
i= do Ts/ Ss and 1/Si= De IWS 6 

Therefore 

6A; = (T; — HS)/S8; . (4.2.6) 


The process is continued as in (4.1.3) until 6d; is sufficiently small. 
Then 


= Dae: ? 
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where S, , taken from the last iteration, is asymptotically distributed as 
x’ with t — 1 degrees of freedom if Ho is true. 

If we change (4.2.4) to a product by taking the antilog of the equa- 
tions, it represents a generalization of Norton’s [1945] extension of 
Bartlett’s [1935] test. The solution to the product form of (4.2.4) 
is given by 


OA; = (R; a; 1/ShA)/R;S; ) (Avzi0) 
where 
a eg” ax | 
lige I] (u CiNg ) Gu lerss., p 


S, and 1/S are as previously defined andh = Dy 1/(R;8;). For small 
values of c;; , (4.2.7) may be a more convenient computational form 
than (4.2.6) because tables do not have to be consulted. Also in the 
case of (4.1.3) with one degree of freedom the product form may be 
more convenient. The solution is 


Ihe Bees ee) (4.2.8) 
where 
Tie == Il (u; — Ciel 5 Ihe == II (v; + Cue ae 
a Ws c3./(ui — C1;do) and SS: = DS ci:/Q; + ¢1:Ao)- 


5. TESTING OF THE FIT OF THE MODEL 
AND FITTING THE MODEL 


Even though the significance level and the operating characteristics 
of the tests are not what they are presumed to be, most investigators 
test the agreement of the model with the data before proceeding to 
make tests on the parameters in the model. The procedures described 
in Sections 3 and 4 can be adapted for this purpose. 


5.1 Test of the Fit of the Model. 


The method can be made intuitively plausible by drawing an analogy 
with the analysis of variance. Recall that the residual or error sum of 
squares in the analysis of variance for a factorial experiment can be 
regarded as the sum of squares due to high-order interactions. Hence 
this sum of squares can be computed as the sum of squares associated 
with a set of contrasts. To obtain the proper test statistic choose C 
as the set of contrasts which would be interpreted as interaction among 
effects included in the model and interpreted as error, and then estimate 
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the P; subject to the restraint Cl = O. This is equivalent to requiring 
that there be no deviation from the model, which is the hypothesis 
one wants to test. Once C is ascertained, the estimation and testing 
proceed as previously outlined. The 2 computed in the process of 
making this test can be utilized to compute the ML estimate of 6 with- 
out the iteration required by conventional methods. 


5.2 Estimation of Parameters. 


To ascertain the relationship between 2 and the observed logit, 
let us expand J; about the point Pio , the value of P; if the data fit 
the model. Then, to a first-order approximation 


log. (P;/Q:) = log. (Pio/Qio) + Wine Po) /(P i0Qio) - 


Now C.,, = P; — Pi . Solving for log. (Po/Qio) and replacing 
P, and Q; by their estimates, u;/n; and v;/n; , we find 


log. (PO) = log, (u;/v:i) — (X'C.)/(Pi0Qio), 


where 
Pi = (u; = XC.) /N: : 


Therefore we see that log, (P:o/Qio) taken from the test for the fit 
of the model represents the working logit. 

Thus if we obtain estimates of the P; subject to a suitably chosen 
set of restraints we can use them in the equation, 


A/WA0 = A’WI*, 


where W is a diagonal matrix of elements n;P:oQ:9 and 1* has elements 
log. (Pso/Qio), to estimate @ without the iteration ordinarily associated 
with the solution of ML equations of this type. 

Although iteration is not required to compute 6 by this method, 
it is required for computing 4. By conventional methods a k X k 
matrix must be inverted for each iteration, but by this method a 
(r — k) X (r — k) matrix is inverted in each iteration in computing 
2 and ak X k matrix is inverted once in computing 6. Since usually 
r — k < kin problems of the type envisioned here, there is some saving 
in computation time through the use of this method. Furthermore 
for some experiments, 2 can be obtained by methods given in Section 
4.2 without inverting matrices. Or, as in the example that follows, 
we may need to compute 2 in the process of making the tests. 


5.3 Haample. 
Cochran [1954] gives the following data: 
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TABLE 1 
Data on NuMBER OF MorHERS WITH Previous INFANT LossEs 


No. of Mothers with 
Birth Order Losses No Losses Total 
Problems 20 82 102 
2 Controls 10 54 64 
Problems 26 41 67 
3-4 Controls 16 30 46 
Problems 20 22 49 
5+ Controls 14 23 37 


It is desired to compare the mothers of Baltimore school children 
who have been referred by their teachers as presenting behavioral 
problems to mothers of a comparable group of control children. For 
each mother it is recorded whether she had suffered any infant losses 
previous to the child in the study. 

The model assumed is 


a et oe a= 7p) = 0, 


where a, is the effect of problem or control depending on whether 
i = 1or2, and 8, is the effect of the j-th birth order, j = 1, 2, 3. 

First we will test the assumption that the model proposed fits the 
data. This may also be regarded as the test of the hypothesis of no 
a@-interaction. If there is no interaction 


ha — he = by — bn = ev — be - (S321) 

A convenient form of (5.3.1) to use as restraints in estimating the P,; is 
hi — le — ler + bee = 9, 

lor — leo — lar + Lae = 9. 


(5.3.2) 


The estimates are: 
-P,, = (20 — r1)/102, Po = (26 — r2)/67, Ps, = (27 +r + .)/49, 
P,. = (10 + A) /64 , Po = (16 + .)/46, Pa = (14 — A = 2a)/87- 


To complete the solution, we will use the antilog form. 
Let A, + Ae = —As , SO that \, + de + As = 0. Then the equations 
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(20 — as) (34 ~ sy ie -- aa e ~ a) a eo - a) (2 - bs 
82+ 2,/\10 +2,  \41 + r/\16 + AQ ~ \22 + dr3/\14 + As/’ 
must be solved for A; , A2 , 4s Subject to the restriction \; + A» + As = 0. 

The starting values of \,) are determined as follows: The objective 
of the iteration can be regarded as to make R, = R, = R; , since when 
this occurs all 64; = 0. Therefore for the first trial values only, A, , Re 
and R; need to be computed. The initial values of Aip = Avo = Aso = O 
can be used if there are no zeros in the success and failure classifications. 
If for this test on this set of data we start with all \;) = 0, five iterations 
are required for accuracy comparable to the solution given. Needless 
to say, some practice in choosing A,» can save computing time. 

After some preliminary examination of the type described, the 
values of 


No = —.d, Neon = —1.0, As0 Kae eS 


were chosen for use in the complete cycles of iteration. After the 
computation shown in Table 2 we find 


Ai = —.503, Ay = —1.218, As = 1.716. 


TABLE 2 
CoMPUTATION FOR TEST OF INTERACTION 


Iteration 1 Iteration 2 
Ry 1.443009 1.443009 
R. 1.395000 1.448979 
Rs 1.505148 1.444384 
Si . 184662 . 184662 
Se . 160962 .161549 
Ss . 192797 .192333 
1/( R181) 3.752782 3.752782 
1/(R2S2) 4.453510 4.286823 
1/( R383) 3.446054 3.599673 
1/Sh 1.443038 1.443789 
dA1 .000 — .003 
OA2 — .214 .001 
dA .214 .002 


Of course this is only the approximate solution. Computation was 
stopped at the second interation because 5A; were trivial, indicating 
that further iteration would have very little effect on the test statistic. 
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Then 
X? = NS, + AZS2 + ASS3 


= .851 


For a x-test with two degrees of freedom and .05-significance level, 
the critical region is given by X* > 5.991. Therefore we do not reject 
the hypothesis of no interaction. 

Now that we have some assurance that the model fits the data, 
we can proceed to test hypotheses about the parameters in the model. 
The objective of the study is to ascertain whether there is a difference 
between problems and controls. This may be stated as 


Hy 2a, — oe = 0: 
The restraint associated with this test is 
Lit ioe La Nes ae lo. — lyn = 0, 

which gives the equation 

(22 - a) (26 -- a\ (22 - \) ad G Zo aI 4. ae + \) 

Bo Se n/ \40 4) \22 4A — X/ 130 = 0/128 — 07, 
As a preliminary estimate of \, choose \y = 2. After two iterations we 
find \ = 2.188 and X? = 2.475. Therefore if we put the significance 
level at .05 we do not reject H, . It is interesting to note that the 
probability of observing an X 2 > 2.475 is approximately 12 percent 
if H, is true while the test Cochran [1954] suggests has probability of 


approximately 10 percent. 
Another hypothesis of interest is 


Hy : Bi = Bz = Bs , 


that is, equality of birth-order effects. To test this hypothesis we use 
the restraints 


lay tele = ber + len = Jar + ile , 
which can be written 
Li t he — ba — bee = 0, 
lor + lee — Isr — Ie = 0. 


Using the same technique as in the test for interaction, we obtain the 
equations 


a _ ny (3 - by _ 2 = aa)(18 = Bi _ ie - as) (18 — 4) 
8D da/ N54 Ea) NAL FE Aa/\BO $a 122 + Ad/\28 1 Dal 
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Starting with \io = —9, Axo = 5, Azo = 4, after three iterations we 
find that 
A, = —9.925, Ny, = Bare). dh; = 6.596; 
and X? with two degrees of freedom is 24.239. Therefore we reject Hy. 
Now that there is evidence that birth order has a significant effect 
it will be instructive, as an example of the technique for testing a single 
contrast, to ascertain the nature of the effect. For simplicity let us 
assume that the birth orders 2, 3 — 4, 5+ are approximately equally 
spaced. Two hypotheses of interest are: is the effect of birth order 
linear or does it require a second-degree polynomial equation to de- 
scribe the effect? 
This can be ascertained by testing 
Ay > Bi — Bs = 0, 
and 
Ho. : Bi — 282 + Ba = 0. 
The restrictions for estimating the P,;; are 
Ly = lie a sy a zo = 0 
and 
Ly ae lie tory Ql = 2los as sy Se Iso = 0 
for testing Ho, and H,, respectively. For the test of Ho; , the equation is 
(2 ~ ae = s) (22 = a\( + 
82 +r/\54+/ 22 — A/\28 — DI 
Starting with 4, = —7 in three iterations we find \ to be —7.535. 


The computations as given by (4.2.8) are shown in the following 
table. 


TABLE 3 
COMPUTATION FOR TEST OF A SINGLE CONTRAST 


Iteration 1 Iteration 2 Iteration 3 
Ri . 130213 . 1389294 . 139544 
R, . 160919 . 140070 .189537 
Si . 130470 . 128352 . 128297 
Se . 260673 . 272320 . 272661 
5X — .521 —.014 .000 
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X? = 07°(8, + S,) = 22.765. 
This is far beyond the .05-point for x° with one degree of freedom. 
Therefore we reject Ho: . 
For Ho. the equation is 


(2 - A) (20 - (22 - x) (i = S _ ie + 2n\(28 ap 2) 
82 + r/\54 + 2/122 + /\28 + J — \41 — 2d/ \80 — 207 | 
Starting with \, = —1, after two iterations we find \ to be — 1.191 and 
X? = 1.478. Therefore we do not reject Ho. and we conclude that 
within the range covered by the data, birth-order effects are linear on 
the logistic scale. 

For the data presented here for illustrative purposes, it is not par- 
ticularly helpful to estimate the parameters in the model. However, 
we will proceed with the estimation to illustrate the non-iterative 


procedure for estimating parameters. The P,,; obtained under the 
no-interaction hypothesis are 


Py — .2010, ps 


4062, Pa 


.5160, 


I 


Pe 1488, 9 P3215, Be = 4248, 
and the associated /*; are 

* = —1.380, Li — 350; ix = .064, 

ig — —1.750, Es = — .747, (es = — 302. 


The model can be reparameterized in several ways to make the 
equations non-singular. One way is to let aw = a — &,m = Bi — fo, 
ne = Bi — Bs, then 


aN ie 
iret lees es 
ie icles! 0} 
dee 0 
Pur ieeeOne 4 
ey Se ees et 
and the normal equations are 
71.93772 —17.62000 — 1.72980 3.18720 || » | —52.3495 
—17.62000  71.93772 — 2.17140 — 5.10068) a | _ 3.5944 | 
— 1.72980 — 2.17140 50.65988  24.46504 || m == 93 122 
3.18720 — 5.10068  24.46504 45.74288\.mJ | —34.8062 
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The solution is 
w= —.749, a = —.184, m = —.-185, nN. = —.6380, 


and the variances of the estimates are .0149, .0149, .0267 and .0298 
respectively. The predicted logits using these estimates are 


ae — —1.381, esi = — .380, ee = .064, 
w= —1:748, Ls = —./48, is = —.303, 


which are very close to those used as the original estimates. Unless 
more then three-place accuracy is desired no further iteration is 
necessary. 


6. SUMMARY AND DISCUSSION 


In this paper we have developed new methods for making tests on 
the parameters in the logistic model. For many types of data the 
methods developed here allow computation of X” for tests of hypotheses 
about some subset of the parameters in the model without having to 
estimate them all as is usually done when using the logistic model. 
This might be particularly useful in combining 2 X 2 tables, as in the 
test of problems versus controls in the example given, which can be 
regarded as three 2 X 2 tables. If all that we desire to do is to test 
the difference between problems and controls, we need to do only that 
part of the work done for testing a; — a, = 0 in the example, if we 
can assume that the logistic model fits the data. 

Once 2 associated with a properly chosen C-matrix has been de- 
termined, it can be used to compute the ML estimate of the param- 
eters in the model without further iteration. The relationship between 
2 and the working logit and between the logistic model and Bartlett’s 
test of interaction is shown. 
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A GENERALIZED MODEL OF A HOST-PATHOGEN SYSTEM 


C. J. MopE 


Department of Mathematics, Montana State College 
Bozeman, Montana, U.S.A. 


INTRODUCTION 


This paper is a sequel to a paper, “‘A Model of a Host-pathogen 
System with Particular Reference to the Rusts of Cereals’, which 
appeared in Bzometrical Genetics [1960]. The former paper was re- 
stricted to the summer stage of the rusts of cereals, but in this paper 
a wider class of host-pathogen systems will be considered and some 
assumptions will be relaxed. Specifically, the assumptions of random 
association of the host and pathogen, equal number of varieties of the 
host and races of the pathogen, and constant fitness functions will be 
dropped. 

The results of this paper are intended to apply to host-pathogen 
systems satisfying the following conditions: 

1. The pathogen reproduces on the host. 

2. The host may be differentiated into varieties on the basis of 
its resistance to the races of the pathogen. 

3. The pathogen may be differentiated into races on the basis of 
its ability to grow on a set of host varieties. 

4. Host resistance to a particular race of the pathogen is genetically 
controlled. 

5. The damage to the host caused by the pathogen in a given time 
interval is directly related to the increase in number in the pathogen 
population during the given time interval. 

It should be pointed out that conditions (1) and (4) imply that 
no assumptions are made with respect to the mode of reproduction of 
the pathogen and the mode of inheritance of host resistance to the 
pathogen. 

Many economic crop plants and their foliar diseases caused by 
pathogenic fungi are examples of host-pathogen systems satisfying the 
above conditions. These host-pathogen systems are characterized by 
frequent shifts in the racial frequencies of the pathogen population, 
making it difficult to maintain host resistance to the pathogen. It 
seems plausible that damage to the host in such host-pathogen systems 
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could be adequately controlled if (1) the racial structure (the relative 
frequencies of the races) in the pathogen population were stabilized 
and (2) the increases in number in the pathogen population was held 
below some critical number during the growing season of the crop. 

In a previous paper, Mode [1960], the writer was unable to find 
conditions under which the racial structure of the pathogen popula- 
tion was stabilized. The purpose of this paper is two-fold, namely, (1) 
to characterize a host-pathogen system containing an arbitrary number 
of varieties of the host and an arbitrary number of races of the pathogen 
and (2) to find some conditions under which condition (1) of the previous 
paragraph is met. It will be assumed that the host-pathogen systems 
under consideration may be characterized in terms of continuous and 
differentiable functions of time. 

In passing it is interesting to note that in at least one case, the 
genetics of both the host and the pathogen has been worked out. The 
reader is referred to the remarkable paper of Flor [1956] in which the 
complementary genetic systems of flax and flax rust are discussed. 
As an example of experimental work in the field under consideration, 
the reader is referred to the recent paper of Suneson [1960]. 


1. POPULATION NUMBERS AND THE ASSOCIATION 
OF THE HOST AND PATHOGEN 

Let H be the number of members of the host population and let 
H; be the number of the members belonging to variety V;(¢ = 1, --- , m) 
at time ¢t. Similarly, let P be the number of members of the pathogen 
population and P; the number of these members belonging to race 
R;G = 1, ---,n) attimet. The relations of the H; and P; to H and P 
are given by 


H oe SH, and P a Se ° CLaD) 
i=1 j=l 


Let a member h of the host population and a member 7p of the 
pathogen population be chosen at random. The probability that h 
belongs to variety V; is 


Pr (he V;) = H/A = or, G2) 
and the probability that p belongs to race R; is 
Pry ett, = Fj) P= yf; (1.3) 


We shall consider next the association of the host and pathogen. 
Let v,; be the probability that at time ¢ a member h of the host popula- 
tion belonging to variety V; and a member p of the pathogen population 
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belonging to race FR, are associated. In symbols we may write 
Pr(he V; ,pek;) = Gi - (1.4) 
If the host and pathogen are associated at random, then we have 
independence in the probability sense so that 
Gis = ViY; » (1.5) 


for all 7 and 7. 

In general, however, we would expect non-random association of the 
host and pathogen to be the rule due to the nature of the specific re- 
action of the host and pathogen to each other. In order to take non- 
randomness into account we introduce a measure of departure from 


random association, @;; . The measure of departure from random 
association is a positive number satisfying the relation 
Gig = OijLiY; - (1.6) 


Kimura [1958] introduced this measure in connection with the study 
of nonrandom mating diploid populations. We note that the measure 
6,; is related to certain conditional probabilities. The conditional 
probability that a member p of the pathogen population belongs to 
race R,; given that it is associated with a member h of the host popula- 
tion belonging to variety V; is 


Pr (pe h; | he Vi) = gij3/Li = Yidi; - Gy) 


Similarly, the conditional probability that a member h of the host 
population belongs to variety V; given that it is associated with a 
member p of the pathogen population belonging to race R; 1s 


1246 (h 62 Va | Pp e R;) = 0:3/Ys = x93; 0 (1.8) 
Since x;6;; and y;0;; are conditional probabilities, it follows that 
ye D0 ye y:0;; = 1. (1.9) 


2. THE FITNESS FUNCTIONS 


Due to the specific nature of the reaction of the host and pathogen 
to each other, with each association <j of the host and pathogen we 
shall associate two fitness functions, one for the host and one for the 
pathogen. The fitness functions may be regarded as measures of the 
ability of the host and pathogen to reproduce in a given association. 
Let \,, be the fitness function of the host and y,; the fitness function 
of the pathogen in the zj-th association. Since the ability of the host 
and pathogen to reproduce in a given association may depend on the 
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x, and y; , we shall allow \;; and u;; to be real-valued functions of the 
real variables x; and y; . Clearly, for every ¢;; we associate a \,;; and 
u;; So that the pair (\,; , «;;) may be regarded as a random variable 
with respect to the ¢;; . 

The expected value of \;; or the mean fitness of the host population is 


HA) =. = Dy eisda , (2.1) 


and the expected value of u;; or the mean fitness of the pathogen popula- 
tion is 


E(u) Sa et De Psi : (2.2) 


The mean fitness of host variety V; , or the conditional expectation 
of \;; given V; , 1s 


EQ | Vj) == >» YiOurer 5 (2.3) 


paal 


and the mean fitness of race R; of the pathogen, or the conditional 
expectation of u;; given FR; , is 


E(u | R;) Sl a De Bs s5mss ° (2.4) 


The conditional expectations of X;; given R; and y;; given V,, 
which are also of interest, are given by 


EQ |R,;) =A; = SB LiOsihiz , 
aes (2.5) 
E(u | V,) ty Ute eas oS Yi OisMiz - 


We now make the following definitions of the measures of fitness 
of variety V; and race Rk; . The measure of fitness of variety V,; of the 
host population is defined by the differential equation, 


d(log H;)/di = »,. , (2.6) 
and the measure of fitness of R; of the pathogen population is defined by 
d(log P;)/dt = w.; - . (2.7) 


These definitions of the measures of fitness of variety V; and Rh, 
are equivalent to those given in the previous paper. 

From the definitions of the measure of fitness of variety V; and 
race R; , it may be shown that the change in the probabilities x; and y; 
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in time is characterized by the set of differential equations 
dx;/dt = ZN Aes © 
dy ;/dt = Yi(u.; a pes (j = 1b, 2s ,n). 


Finally, by differentiating H and P with respect to time, we may 
show that the changes in population number in the host and pathogen 
populations are given by 


d(log H)/dt = d.. , d(log P)/di = p.. . (2.9) 


Lip te , Mm), 


I 


(2.8) 


3. THE VARIATION AND COVARIATION IN FITNESS 
IN A HOST-PATHOGEN SYSTEM 


We continue our characterization of the host-pathogen system by 
defining certain variances and covariances in fitness. 
The total variance in fitness in the host population is 


var (A) = E(A — 2,.)? = 3. giildsi — A.) (3.1) 
the total variance in fitness in the pathogen population is 


var (u) = E(u — wu...” = Do viiluis — os (3.2) 


and the total covariance in fitness in the host-pathogen system is 
cov A, ») = HA — 2d.) — w.) = De: gis(Asz3 — V..)(uis — w..)- (8.3) 


In addition to the variance and covariance in fitness, we may also 
define certain components of variance and covariance which are useful 
in characterizing a host-pathogen system. 

The variance in fitness in the host population attributable to varieties 
is 


var (A; V) = DS eA ae Xs (3.4) 
the variance in fitness attributable to races is 
var (\;R) = Di yi. — A.) (3.5) 


and the variance in fitness in the host population attributable to the 
interaction of varieties and races is 


var (A; VR) = De 5;(Va; y Vi. se r.; ae Near (3.6) 
By continuing in the same way we may define analogous components 


of variance in fitness for the pathogen population. Thus we shall let 
var (wu; V), var (uv; R), and var (u; VR) stand for the components of 
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variance in fitness in the pathogen population attributable to varieties, 
races, and the interaction of varieties and races, respectively. Similarly, 
we shall let cov (A, »; V), cov (A, #; BR), and cov QA, #; VR) represent 
the components of covariance in fitness attributable to varieties, races, 
and the interaction of varieties and races respectively. These com- 
ponents of covariance are, of course, defined in the obvious way. 

In particular, if the host and pathogen are associated at random 
so that 6,; = 1 for all 7 and j, then the following relations among the 
total variance and covariance and the components of variance and 
covariance hold. 


var (x) = var (A; V) + var (A; 2) + var Qe VR); 
var (u) = var (u; V) + var (u; R) + var (u; VR), (3.7) 
cov (A, uw) = cov (A, w; V) + cov (A, uw; R) + cov (A, u; VR). 


Relations (3.7) will not hold in general, however, if the association 
of the host and pathogen is nonrandom. 


4. THE CHANGE OF ).. , ».. , VAR(A), VAR(x), 
AND COV(A,u) IN TIME 

We complete our characterization on the host-pathogen system by 
finding differential equations characterizing the change in A.. , #.., 
var (A), var (4), and cov (A, #) in time. The proofs of the results of 
this section are easily obtained by straight-forward differentiation of 
the functions in question with respect to time and by using the results 
and definitions of the preceding sections. 

When one wishes to find these differential equations, certain variables 
arise. A considerable simplification in representation may be gained 
¢ we set 6., = A(log 8,;)/dt, 6; = d(log gs;)/dt, dss = Adss/dl, and 
a;; = dy,;/dt. The relations 6;; = 9:;0:; and i; = ¢ii@ii are also 
useful. Note, with each ¢,; we may associate the four-tuple, (¢:; , 
6:3 , is » Bas), SO that the four-tuple may be regarded as a random 
variable with respect to ¢i; - 

With the above conventions, the change in mean fitness in the 


host population becomes 

dy__/dt = var (d; V) + cov (A, u; R) + BO EOD) 
and the change in mean fitness in the pathogen population becomes 
du../dt = var (u; R) + cov (A, #5 V) + E(u) + B@). (42) 


Thus, the change in the mean fitness in time in the host population 
partitions into a variance component in the host population attributable 
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to varieties, a component of covariance attributable to races, a term 
attributable to the changes in the measures of departure from random 
association, and a term attributable to changes in the fitness functions 
in time. The equation characterizing the change of mean fitness in 
time in the pathogen population is similar. If all 6;; , \;; , and ui; 
are constant, then equations (4.1) and (4.2) reduce to the results given 
in the previous paper. 

The differential equation characterizing the change in time of the 
total variance in fitness in the host population is 


d[var (A)]/dt = Elg( — d..)?] + 2 cov (A, d), (4.3) 


and the differential equation giving the change in time of the total 
variance in fitness in the pathogen population is 


d[var (u)|/di = Elg(u — u..)°] + 2 cov (u, 2). (4.4) 


Finally, the differential equation characterizing the change in the 
total covariance in fitness in time is 


d[cov (A, »)]/dt = E[gQ\ — ..)(u — w..)] + cov (A, ») 
+ cov (A, fi). (4.5) 
Jt will be noted that in equations (4.1) through (4.5) we have set 
EQ) = Leeuks, BO) = Teududs » 


z (4.6) 
EIGQ — YT = Li eubslsi — 2... 


E(6u), E(u), El(gu — w..)"], and E[gA — 2..)(u — w..)] are of course 
defined in a similar way. It is instructive to study the form of equations 
(4.3), (4.4), and (4.5) according as all 6;; , \;; and y;; are constant or 
nonconstant. 


5. STATIONARY STATE SYSTEMS 


We shail say a host-pathogen system is in a stationary state if all 
yi; cease to change with time. Our discussion of stationary state 
systems begins by considering the case when all 0;; , \;; , and y,; are 
constants. We note from equation (1.9) that, if all @;; are constant 
and the system is in a nonstationary state, then the 2; and y, must 
change in such a way that the equation 


SY, LiY;9;; — 1 (6,1) 


is satisfied. 
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Let A = [a;;] = [6;;\;;] and B = [b.;] = [::#:;], be constant m xX n 
matrices, and let &’ = (4,,--:,4,) and’ = (hi, °°: 9,) be vectors 
of stationary state probabilities. In addition, let X\.. and f.. be the 
values of \.. and u.. corresponding to the stationary state vectors 
Xand¥. Finally, let c; andc, represent m X 1 and n X 1 column vectors 
consisting of all 4..’s and g..’s respectively, ie. cf = (A.., °°" | i...) 
and cf = (f.., -°:*, f@..). Throughout the remainder of the paper a 
prime will be used to denote the transpose of a matrix or vector. 

From the definition of a stationary state system, it follows that 
four classes of stationary states may exist; namely, (1) when popula- 
tion number in both the host and pathogen populations is constant, (2) 
when population number in the host population is constant but that 
in the pathogen population is variable, (3) when population number 
in the host population is variable but that in the pathogen population 
is constant, and (4) when population number in both the host and 
pathogen population is variable. Henceforth we shall refer to the 
four classes of stationary state systems as systems of Class: I, IT, 111, 
and IV, respectively. * 

If a host-pathogen system belongs to Class I, for example, then the 
defining equations of the class are 


d(log H)/dt 
d(log P)/dt 


r.. = 0, 


(5.2) 


I 


y., = 0. 


Moreover, if the system is in a stationary state, then the set of 
differential equations 


dz,/dt = x:;, = 0 (Gj Sal OSB iO, (5.3) 
dy;/dt = yi.; = 9 G= fee Ne 


must be satisfied. We shall refer to equations of the form (5.2) as 
equations of a stationary state. And, if the stationary state is non- 
trivial, i.e. all x; and y; are not zero, then the equations 


u.; = 0 (ja HO 


must be satisfied. . 
By writing equations (5.4) out in full, we see that the stationary 


~ state vectors ¥ and ¥ must satisfy the algebraic equations, 
B sean Of yee AY 025 (5.5) 


where 0, and 0, are n X 1 and m X 1 zero vectors respectively. By 
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continuing in this way, we may find the defining equations, the equa- 
tions of a stationary state, and a set of equivalent algebraic equations of 
the three remaining stationary state systems. The results are given 
in Table 1. 

For the case of the matrices A and B nonconstant, we may again 
write down the defining equations of a class, the equations of a stationary 
state, and the algebraic equations of a stationary state. We shall, 
however, restrict our considerations to that class of fitness functions 
which are such that the algebraic equations of a stationary state admit 
at least one nontrivial solution. 


6. SOLUTIONS OF THE STATIONARY STATE EQUATIONS 


In this section we shall state some conditions under which solutions 
of the stationary state equations exist for the case A and B are constant 
matrices. For a detailed treatment of the methods used in this section 
the reader is referred to a book on matrix algebra such as Perlis [1952]. 
We shall say a stationary state is unique if, and only if, the equations 
of a stationary state admit a unique solution. The four classes of 
systems will be considered in order. 

The algebraic equations for a stationary state system of Class I are 


B’x = 0, ) (6.1) 
Ay = 02. (6.2) 


Note that (6.1) is a set of m linear equations in m unknowns and 
(6.2) is a set of m linear equations in n unknowns. Let min (m, n) be 
the minimum of m and n. If the rank of B is 7, and that of A is r, ; 
then neither 7; nor r, can exceed min (m, n). Moreover, equations 
(6.1) and (6.2) admit a nontrivial solution if, and only if, the relations 
r, <mandr, < n are satisfied. 

Let 8, be the set of all ¥ satisfying equation (6.1) and let 8, be the 
set of all ¥ satisfying equation (6.2). The sets 8, and 8, are vector 
spaces. Thus, any multiple of a vector or linear combination of vectors 
belonging to the set is again a member of the set. 

If the rank of B is r, < m, then equation (6.1) may be reduced by 
elementary row operations to the form 


Dsa.B; 
! a 
Stns das x= 0: ) (6.3) 


where I,, is an identity matrix of order r, , B, isar, X (m — r,) matrix 
of constants, and 0, isan — r, X m zero matrix. The row vectors of 
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the (m — r,) X m matrix 
Bry =e) (6.4) 
are solutions of equation 6.1 and form a basis for the vector space 8, . 


Similarly, if the rank of A is ro < n, then (6.2) may be reduced to 
the form 


Ti Uealeeso: (6.5) 


where I,, is an identity matrix of order r, , A, isa 72 X (n — re) matrix 
of constants, and 0, is a (m — re) X n zero matrix. The row vectors 
of the (n — rz) X n matrix 


(Al i.—I,_,,) (6.6) 


are solutions of equations (6.2) and form a basis for the vector space §, . 

Any pair of vectors (X, ¥) where X belongs to S, and ¥ belongs to 8, 
with coordinates satisfying the conditions, 0 < ¢; < 1,0 < g; <1, 
>: 4; = 1, and >); 9; = 1, is a solution of equations (6.1) and (6.2) 
and are, therefore, probability vectors of a stationary state. Clearly, 
the pair (&, ¥) is not unique so that there exists no unique stationary 
state for systems of Class I. 


For systems of Class II the algebraic equation of a stationary 
state are 


B’x = Co ) (6.7) 
Ar 0s. (6.8) 


Equation (6.8) may be solved by the methods discussed in Class I 
systems but (6.7) needs special consideration. Writing (6.7) component- 
wise we have 


»S b;,2; = > Lip. GHA, ee: 5), (6.9) 
where 
i = a3; b3G; - 
j= 
For each probability vector ¥ which is a solution of (6.8), we may 


find a @;, which is independent of %, and for each ;. corresponding 
to a y we have the system of homogeneous equations 


Dy (bie — ees = 0 G1, ++, 2). (6.10) 
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It is clear that the methods discussed in Class I systems may again 
be applied to find solutions of equations (6.10). We note that there 
exists no unique stationary state for Class II systems since there is no 
unique solution of equations (6.7) and (6.8). It is clear that the methods 
used to find stationary state solutions for Class II systems may again 
be applied to Class IIT systems. 

For systems of class IV the algebraic equations of a stationary 
state are 


BX = c , (6.11) 
AY =c,. (6-12) 


Equations (6.11) and (6.12) cannot be solved by the methods used 
heretofore since they are nonlinear in the components of the unknown 
vectors and ¥. If at least one of the vectors c, and c, is constant, 
then we may use the methods of solving linear equations to find solu- 
tions of (6.11) and (6.12). 

For example, if the vector c, is constant, then equation (6.11) admits 
a solution if, and only if, the rank of the matrix B’, satisfies the relation 
r, < mand the rank of the augmented matrix (B’, Cs) 1s. li ean, 
then (6.11) admits infinitely many solutions. If r,; = m, then (6.11) 
admits a unique solution which may be found by Cramer’s rule. In 
all cases we may place conditions on the elements of the matrix B’ 
so that at least one probability vector is a solution of (6.11). 

For each probability vector which is a solution of (6.11) it may 
be shown by the methods used in the discussion of Class II systems 
that (6.12) becomes a homogeneous equation in ¥. In general, homo- 
geneous equations do not admit unique nontrivial solutions so that a 
unique stationary state cannot exist in this case. The cases in which 
c, is constant and both c, and c, are constant may be treated similarly. 

It is of interest to note that if (1) the vectors % , --- , %, are solu- 
tions of (6.11), (2) the vectors f, , --- , ¥. , are solutions of (6.12), and 
and (3) a; (« = 1, --- , k) are positive numbers which satisfy the con- 
dition, a, + --: +a, = 1, thena,% +--+ + a,x, and af; + +++ + aS 
are also solutions of (6.11) and (6.12) respectively. 

The case in which m = n, A and B are nonsingular, and the host 
and pathogen are associated at random is covered in the following 
interesting theorem: 


Theorem 6.1: If (1) m = n, (2) the matrices A and B are non- 
singular, and (8) all @;; = 1, then x, y, \.., and f.. are unique so that 
a unique stationary state may exist in systems of Class IV. 
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Proof: Let d,, and fi.. be the values of \.. and w.. corresponding to 
the stationary state vectors and y. If X and ¥ are stationary- 
state vectors, then their components, by Cramer’s rule, must 
satisfy the equations. 


Gps Bayes (a 
§, =4..| A; |/| Al Qj 


where B; is matrix obtained from B by replacing the 7-th row by a 
row of ones, A; is a matrix obtained from A by replacing the j-th column 
by a column of ones, and | B; |, | A; |, |B |, and | A | are the determinants 
of the matrices in question. Summing equations (6.13) over 7 and j 
and using the condition the components of and ¥ sum to one yields 


ee (6.13) 


Loe , mM), 


i= (6.14) 
a. = (BIS 1B. 
By substituting \,, and @.. in (6.13) we find 
hes [Bal Bas 

as (6.15) 


$= 1A WIA 


To prove uniqueness, let \* and w* be the values of d.. and u.. 
corresponding to the stationary state vectors x* and y*. Proceeding 
as before, we have by Cramer’s rule 


Ax | A; I/] A]. 


I 


Hiss 


(6.16) 


I 


* 
Y; 


By summing over 7 and j and using the condition the components of 
x* and y* sum to one we findd},. = A\* and... = u*, and by substituting 
\.. and @.. in (6.16) we reach the conclusion that = x* and f = y* 
which proves the theorem. 

It will be noted that in order that % and § be probability vectors 
we must require that the elements of the matrices A and B be such that 
the components of X and ¥ be non-negative. It will also be noted that 
by dropping the assumptions of equal numbers of varieties and races 
and random association of the host and pathogen, we are led to the 
possible existence of non-unique stationary states. 
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7. THE STABILITY OF A STATIONARY STATE 

We next turn to the question of stability of a stationary state 
corresponding to a pair of stationary state vectors (%, 9). For brevity, 
we will simply refer to the stability of a pair (%, j). 

Let z’ = (x’, y’) beal X (m + n) vector and rename the components 
z(k =1,---,m+n). Letdz /dt be a column vector with components 
dz,/dt, and let u(z) be a column vector with components u,(Z) = 
FAC eae NG, for 3-1; 9% and u.(Z) = Ye(ue — w..) for 
>= m+1,---,m+n. with these definitions, differential equations 
(2.8) may be written in the compact form 


dz/dt = u(Z). (7.1) 
In this paper the definition of stability given by Bellman [1953, 


p. 76] will be used. Using this definition, the stability of a stationary 
vector 2 may be decided by using the following procedure. Let 


hile et Se = yg eg Nata) Now by the multivariate version of 
Taylor’s theorem, we may express u(z) in the form 
u(z) = u(z) + Qh + vib), (7:2) 


where Q is the (m + n) X (m + n) Jacobian matrix of u(z) evaluated 
at z and v(h) stands for a vector of nonlinear terms. Clearly, 
dh/dt = dz/dt since zisa constant vector. Moreover, if Zisa stationary 
state vector, then u(Z) equals the zero vector. The question of the 
stability of vector Z is thus reduced to question of the stability of the 
trivial solution h’ = (0, --- , 0) of the differential equation 


dh/dt = Qh + v(h). (7.3) 


We shall obtain conditions for stability under the assumption 
-differential equation (7.3) satisfies the hypotheses of Theorem 1, page 
79 of Bellman [1953]. Before this theorem may be used, however, 
the following observations are essential. From the definition of the 
h, it is easy to see its components satisfy the conditions 


mtn 


yh =0 and Dd, & = 0. (7.4) 
k=1 k=m+1 

Therefore, any meaningful solution of differential equation (7.3) must 
also satisfy conditions (7.4). It may be easily shown, although we shall 
not do so here, that, if the initial conditions satisfy conditions (7.4), 
then all solutions of (7.3) satisfy conditions (7.4). This result permits 
us to work directly with the J acobian matrix Q. 

One of the hypothesis of the stability theorem (Theorem deeerce, 
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Bellman) is that all solutions of the differential equation 
dh/dt = Qh, (7.5) 


approach zero as¢-> ©. All solutions of (7.5) approach zero ast — © 
if, and only if, the real parts of the characteristic roots of Q are nega- 
tive. (Theorem 7, p. 25, Bellman.) The stability of a vector Z is, 
therefore, determined by the properties of the matrix Q. 

Let us next examine the structure of the matrix Q. The matrix Q 
is such that it has the following form 


i Q. | @) 
(oe eee (7.6) 


and the submatrices Q, , Q; , Qs , and Q, have the following forms: 
Q, isa m X m matrix with diagonal elements 


ee DTG ie 
Obi = (2 an.) (= 1, , Mm); ap 
and nondiagonal elements 
at, | Cuan yer eae 
Ghia = 4,( Be aa GsAmy). (7.8) 
Q, isa m X n matrix whose j7-th element is 
A ae. Any. oe sh het 
Cy = 2,(M 2...) Piel eee) (72) 
Q; isan X m matrix whose j7-th element is 
, Ou.g iu.) 
G35 = 9,2 . Fae (7.10) 
Q, isan X n matrix with diagonal elements 
, Oui tu.) , 
aii = Yi = Oy; ) (7 .11) 
and nondiagonal elements 
A Ou. 7 du.) 
G45" = 9,(2H area i>) 


Recall that in the above expressions the carats stand for the evalua- 
tion of the elements of the Jacobian matrix of u(z) at a stationary state 
vector Z’ = (%', 7’). 

The following equations are essential for the determination of the 
properties of the matrix Q. 
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OA. = - ON;; P Od.. On;. 

OA, On; On ae ONG 

Sa Se Uag Nas SS rene: Se ey | ea i 
OY: Ty OY; ay; »D ai (7.14) 


Expressions for du.,/dv; , Ou../dx; , O4.;/OY; , and du../dy; are 
similar to (7.13) and (7.14) and may be written down from symmetry 
considerations. 

It is of interest to examine stability in the following five cases. 
The truth of the statements to follow can easily be deduced from equa- 
tions (7.7) through (7.14). 

1. If a system belongs to Class I and all dj; and u;; are constant, 
then the Jacobian matrix Q has the form 


Dati (7.15) 


where 0 stands for a square zero matrix. Now from the theory of 
characteristic roots it is known that the sum of the elements on the 
principal diagonal equals the sum of the characteristics roots. In 
this case the sum of the elements on the principal diagonal is zero. 
hence, the sum of the characteristics roots is zero. It follows that all real 
parts of the characteristics roots cannot have the same sign and, there- 
fore, cannot be negative. Thus, no stable stationary states can exist 
in this case. Note the role constant population numbers in the host 
and pathogen populations and constant fitness functions play in the 
instability of this case. 

2. If a system belongs to Class II and all \;; and p;; are constant, 
then the matrix Q has the form 


Si ood ; (7.16) 
Q, 1 Qs 
3. If a system belongs to Class III and all \;; and y,; are constant, 
then the matrix Q has the form 


She aie Car) 


4, If the system belongs to Class IV and all \,; and y,; are constant, 
then the matrix Q will have no submatrices which are necessarily a 
zero matrix. 
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5. If all \,; and pw,; are nonconstant, then the matrix Q will not 
have a submatrix which is necessarily a zero matrix. This statement 
is true of systems of Class I, II, II and IV. 

The following theorem supplies a sufficient condition for stability 
in cases 2, 3, 4 and 5. 


Theorem 7.1: Construct a matrix Q* from Q as follows. Let 
qt = Ga (k = 1, +--+ ,m +n) and set g&, = 3(Qux + Gx) fork + Kk’. 
If Q* is negative definite, then the real parts of the characteristics 
roots of Q are negative. 


Proof: Let be a complex characteristic root of Q and let 8, + 76. 
(77 = — 1) be its associated complex characteristic vector of 
dimensions (m + n) X 1. From the definition of characteristic 
roots and vector we have 


Q(6: = 782) a (8, as Bo)d. (7.18) 


Now multiply equation (7.18) by (8; — iG.)’, the transpose of the 
complex conjugate of 6, + i$,. The result is 


d = (818: + 8262) '(BiQ6:; + B.QB2 + 1(B1QB. — B:Q6:)). (7.19) 


Thus since 6/8, + 6462 1s positive the sign of the real part of a character- 
istic root depends on the sign of 6/Q6; + 6/Q8.. But 8/Q6, + B/Q8, = 
B/Q*6, + B{Q*B.. It follows that if the matrix Q* is negative definite, 
then the real parts of all characteristic roots will be negative, which 
completes the proof of the theorem. 

In cases 2, 3, 4 and 5 it is possible to construct a matrix Q* from Q 
so that Q* is negative definite. Therefore, stable stationary states may 
exist in cases 2, 3, 4 and 5. 


8. INTERPRETATIONS AND PRACTICAL CONSIDERATIONS. 


A little consideration will lead to the conclusion that the class of 
stationary state system to which a given host-pathogen system may 
belong depends on the length of the time interval and the size of the 
geographical area under consideration. For example, suppose our 
host-pathogen system is a given cereal crop and some species of rust. 
If we consider this system with respect to the time interval, 0 < ¢ < We 
representing a single growing season and a single field of the crop, 
then any stationary state system would probably belong to Class LT: 
since in a given growing season the number of members of the host 
population is essentially constant but the number of members of the 
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pathogen population would probably be increasing. On the other- 
hand, if the time interval is taken to be some period of years and a 
larger geographical area were considered, then our host-pathogen 
system would probably belong to Class IV; since during this period 
of years the number of members of both the host and pathogen popula- 
tions would probably be changing. It seems likely that the majority 
of host-pathogen systems encountered in practice belong to systems 
of Class IV. 

A question that arises is what procedure should one follow if he 
wishes to construct a host-pathogen system so that (1) it is in a stable 
stationary state, and (2) the number of members in the pathogen 
population at the end of any growing season is less some critical number 
c. In general, the parameters of the system can never be known exactly. 
At the present time, therefore, the most fruitful approach seems to 
be an experimental one. That is, simply construct a set of systems 
consisting of a mixture of varieties and races and observe their be- 
havior over a period of time. Any system satisfying the above condi- 
tions in this time interval would, apparently, be satisfactory from the 
practical point of view. 

Some suggestions for the construction of a host-pathogen systems 
meeting conditions (1) and (2) of the above paragraph for the case of 
equal numbers of varieties and races, constant fitness functions, and 
random association of the host and pathogen are given in the previous 
paper, Mode [1960]. 


9. SUMMARY 


The results of this paper represent a generalization of results given 
in a previous paper. In this paper a characterization of a host-pathogen 
system containing an arbitrary number of host varieties and races of 
the pathogen was given under the assumptions of nonrandom associa- 
tion of the host and pathogen and nonconstant fitness functions. 

Four classes of stationary state systems were defined on the basis 
of constant or nonconstant population numbers in the host and pathogen 
populations. Some methods of finding probability vectors of a station- 
ary state were given for the four classes of systems. 

The four classes of stationary state systems were also checked 
for stability. It was found that if the fitness functions are constant, 
then a stable stationary state may exist in systems of Class II, III, 
or IV but not in systems of Class I. If the fitness functions are not 
constant, then it is possible for a stable stationary state to exist in 
any of the four classes of systems. The possible existence of stable 
stationary states is of considerable practical interest. 
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LATIN SQUARES TO BALANCE IMMEDIATE RESIDUAL, 
AND OTHER ORDER, EFFECTS. 


Paut R. SHEEHE AND IrRwIN D. J. Bross 


Roswell Park Memorial Institute 
Buffalo, New York, U.S.A. 


1. INTRODUCTION 


While the Latin Square designs to be presented have a broad field 
of application, the focus of our discussion will be on their use in clinical 
trials. These extra-balanced designs were, in fact, developed specifically 
to meet an experimental problem that arose in setting up an analgesic 
trial. This specific problem will be considered briefly in order to illus- 
trate the scientific (as distinguished from purely mathematical) rationale 
for doing balanced experiments. 

The original study plan for the analgesic trial called for five agents 
to be used: a placebo, a standard drug, a new agent, a combination of 
the new and standard agents, and a course of hypnosis. Each patient 
was to be his own control, but a decision had to be made about the 
duration of each treatment. If the duration of each treatment were 
as long as a week, drop-outs, with resulting incomplete sequences of 
treatments, would become a serious practical problem. In view of 
this, the principal investigator considered a shorter trial period for 
each agent, such as three days, to be preferable. He was confident 
that this period of observation would be long enough to elicit reliable 
responses and, so far as straight chemical carry-over, or residual, effects 
were concerned, analgesic effects would last only a few hours. But 
then he recalled a kind of psychological carry-over effect that he had 
noticed in previous studies. When an effective agent was given after 
a placebo or ineffective agent, it seemed that the effective agent often 
failed. A plausible hypothesis was that the patient had lost confidence 
in the analgesics and it would take some time for his confidence to be 
restored. This applied especially to double-blind studies in which 
the patient often was under the impression that he received the same 
agent all of the time. At this point in the planning, the investigator 
therefore wondered if there was some way to insure that each agent 
tested would be immediately preceded by the placebo an equal number 
of times. Also, since it was possible that several treatments would be, 
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like placebos, relatively ineffective, he wondered if there was a way to 
achieve more complete balance for immediate residual effects by insuring 
that every treatment would be immediately preceded equally often 
by every other treatment. 

The design which achieved this objective was worked out by trial 
and error, but then the study plan was modified to include two new 
treatments. This change emphasized the desirability of having a 
eeneral procedure for constructing predecessor-balanced designs. In 
the literature, E. J. Williams [1] originally set down, in 1949, the con- 
ditions sufficient to produce Latin Squares balanced for immediate 
predecessors. Sufficient conditions for more remote predecessors, 
particularly the next-to-last predecessors, were also specified. Williams 
also presented analysis of variance procedures to accompany the designs. 
Raymond, et al, [2] used this type of design and analysis in 1957 for 
a study of tranquilizing drugs in psychoneurosis. In 1952, H. D. 
Patterson [3] considered the more general problem of predecessor- 
balanced designs, not only for square arrangements such as Williams’, 
but also for certain incomplete block arrangements. 

In 1958, J. V. Bradley [4] presented an easily remembered con- 
struction which meets the conditions set down by Williams when the 
number of treatments is even. He also presented additional balancing 
procedures which might be useful in special cases. To round out the 
picture, we shall present here an easily remembered procedure which 
can be used whether the number of treatments is odd or even. In 
addition to the Latin Square and immediate predecessor-balance 
features (properties 1 and 3, resp., to follow), another balance property 
(property 2, to follow) will be noted. We shall discuss the application 
of this, and other extra-balanced designs, in the context of the clinical 
trial. In the appendix, we shall present a detailed proof of the method. 
A second section of the appendix will be devoted to describing a simple 
variation in construction which produces Graeco-Latin Squares when 
the number of treatments is odd. 


2. PROCEDURE 


Before proceeding further, a more formal definition of ‘balance’ is 
appropriate. In addition, some predecessor balance properties, other 
than immediate, will be defined. A design is called balanced with 
respect to the set of immediate predecessors if every treatment is 
immediately preceded equally often by every other treatment. Simi- 
larly, a design may be called balanced with respect to the set of prede- 
cessors of any specified degree of remoteness (e.g., the second degree, 
which involves the next-to-last predecessors), if every treatment is 
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preceded equally often by every other treatment at that degree of 
remoteness. Balance with respect to the set of all predecessors (without 
regard to degree of remoteness) is achieved if every treatment is pre- 
ceded, immediately or more remotely, equally often by every other 
treatment. The term, complete balance, is reserved for the case when 
the set of predecessors (of a specified or unspecified degree of remoteness) 
is the same for every treatment. Thus, balanced designs are not com- 
pletely balanced only because treatments do not precede themselves. 
The design presented here is (1) completely balanced with respect to 
the number of treatments preceding every treatment, (2) balanced 
with respect to the set of all predecessors, and (3) balanced with respect 
to the set of immediate predecessors. 
The procedure for construction is as follows: 


(a) Number the treatments, 2 = 1, --- ,n. 
_ (b) Start with a cyclic n X n Latin Square, i.e. one in which the 
sequence of treatments in the 7th row is7,7-+1,---,n,1,2,---,27— 1. 


(c) Interlace each row of the cyclic Latin Square with its own re- 
verse order sequence, i.e. with its mirror image. For example, if n = 5, 
the first row of the cyclic Latin Square reads 1, 2, 3, 4, 5. Its mirror 
image is 5, 4, 3, 2, 1, and when this is interlaced with the first row of 
the original square, the interlaced sequence reads 1, 5, 2, 4, 3, 3, 4, 2, 5, 1. 

(d) Slice the resulting n X 2n figure down the middle, thus forming 
two n X n Latin Squares. The columns of each square refer to the 
order of presentation, from left to right, and the rows refer to individuals. 
Treatments appear in the body of each square. 

It will be found that, when 7 is even, each of the constructed squares 
has the three desired properties. In this case, either of the two squares 
may be used. When n is odd, each of the constructed squares has the 
first property, but not the last two. However, when the two squares 
are considered as a whole, the last two properties are indeed present. 
Consequently, in this case, both of the constructed squares must be 
used. Constructed squares for n = 4 and n = 5 are presented in 
Tables 1 and 2. The reader may verify, by inspection, that the stated 
properties are present in either case. It will also be noted that, for 
n = 4 or for any even number, in general, the left square is identical 
with that originally presented by Bradley. Proof that all these proper- 
ties hold in general will be offered in the appendix. 


3. DISCUSSION 


Lest the reader be left with the impression that, because the Latin 
Square principle has been used to achieve a desired order balance, a 
Latin Square analysis is advised, we specifically disclaim this as our 


408 BIOMETRICS, SEPTEMBER 1961 


TABLE 1 
ConsTRucTED LATIN SQUARES FOR 2 = 4 


Left Square Right Square 
Order of Presentation Order of Presentation 
ih we Riz ie paar) eh zy 
Individual Individual 

“IN iy 4h 8 i 3.) 32) 45a 

B 2 1 3 4 F A Ss 1 2 

C Stee eae el G 1 ay PA} 
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intention. In the context of the clinical trial where the problem of 
predecessor balance arose, the primary function of the balancing was 
not to reduce experimental error, but to provide a safeguard against a 
fairly specific danger. This safeguard is present in the designs whether 
we regard them as Latin Squares, or whether we regard individuals as 
blocks. Also, the main reason for using the patient as his own control 
in an analgesic trial is that the response variable is in subjective scale, 
dependent on the value judgment of the individual patient. Thus, if 
a fairly large number of individuals were available for the trial, a 
plausible method of analysis might be to analyze intra-patient compari- 
sons. In fact, the nature of the response variable is not very different 
from that encountered in paired comparison trials. Furthermore, 
the designs here are such that, if each treatment is paired with its 
immediate predecessor, all possible pairs of different treatments will 
be formed equally often. This suggests that an adaptation of the paired 


TABLE 2 
ConstRUCTED LATIN SQUARES FOR n = 5 


Left Square Right Square 
Order of Presentation Order of Presentation 
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Individual Individual 
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comparison analysis presented by Scheffé [5] might be satisfactory for 
practical purposes. Cochran and Cox [6] give an illustrative Latin 
Square analysis (in the manner of Williams), complicated because of 
balance for immediate residual effects, but they too caution against 
the indiscriminate use of the method when the underlying assumptions 
are suspect. 

In some clinical situations, neither a Latin Square nor a one-way 
analysis of all the data would be advisable. For example, in a post- 
operative trial of analgesics, Meier et al. [7] report that observations 
made in the first few post-operative hours, when patient pain status 
was most acute, were much more discriminatory for the efficacy of 
the agents than were observations made thereafter. If, as was actually 
done in the cited case, the later observations were dropped from the 
analysis, the remaining data would still be at least partially balanced. 
Thus, regardless of what analysis might be most appropriate, the 
predecessor balance feature serves as a kind of ‘ace-in-the hole’. If, 
when the study is subsequently reported, the critic should raise the 
question of psychological or other carry-over effects, the investigator 
could reply by pointing to the design precaution that had been taken 
to balance out such effects. 

It is most important, however, to remember that predecessor balanc- 
ing cannot provide perfect protection against carry-over effects. Such 
effects are not likely to be consistent, that is, they may vary considerably 
from one patient to another. To control the carry-over effects, the 
balance would have to be over those patients who exhibited consistent 
carry-over effects, but unfortunately these sub-sets of patients cannot 
be distinguished in advance. Nevertheless the control should be closer 
where there is balance over a single experiment than with the usual 
Latin Square where there would be balance over a hypothetical large 
series of experiments. This more modest justification is sufficient for 
practical purposes. We should not expect any design feature to com- 
pletely solve a deep-rooted experimental problem—all that can reason- 
ably be asked is that the device improve the degree of control. This 
point deserves attention because many useful design features have 
been “over-sold” so that the investigator gets the impression that he is 
completely covered by his statistical insurance policy. 

What are some other limitations and drawbacks of these designs? 
One limitation is that the designs are not balanced with respect to 
second-degree (and higher-degree) predecessors. A more practical 
limitation (for designs with an odd number of treatments) is that the 
number of patients must be a multiple of 2n in order that balance be 
achieved. For example, with 7 treatments there might be good practical 


410 BIOMETRICS, SEPTEMBER 1961 


reasons for terminating the experiment after 35 patients or so have 
come in, but 42 patients would be required to balance the design. This 
limitation is less important, however, when the emphasis is on pre- 
cautions rather than analysis. 

Another limitation is that complete balance (as defined) with respect 
to immediate predecessors is not achieved. But by slightly modifying 
the designs given here, complete balance can be achieved by simply 
inserting a “zeroth order” column, identical with the first column, in 
each square. If the data from the zeroth order column are discarded 
in the analysis, it will be found that each treatment is immediately 
preceded equally often by every treatment, including itself. Thus the 
set of immediate predecessors is the same for every treatment. Bradley 
has already pointed this out in his discussion of an even number of 
treatments. It is equally true for an odd number of treatments. But 
the advantages of this further balance would have to be weighed against 
the possible disadvantages. For example, in the post-operative pain 
situation reported by Meier et al. and cited above, it would be especially 
undesirable to discard initial post-operative observations. 

Another simple extra-balancing feature, mentioned by Bradley for 
an even number of treatments but equally applicable for an odd number, 
provides column balance analogous to the row order balance. The 
procedure is to permute the rows of each Latin Square in such a way 
that the first column reads down in the same sequence as the first row 
reads across. Note, however, that the addition of this feature sharply 
limits the number of possible Latin Squares. From a theoretical stand- 
point (e.g. the randomization set justification of the Latin Square 
analysis) the restriction raises some difficulties. Incidentally this same 
objection applies, though usually with less force, to any other extra- 
balanced design. Sir R. A. Fisher has vigorously maintained the 
position that the analysis of data must take into account the restrictions 
of the design and this view has been accepted by a majority of statis- 
ticians. This point was debated by Student (W. G. Gosset) and Fisher 
in the 1920’s. The discussion of the Knut Vic (Knight’s Move) Latin 
Square is directly relevant here since this square contains an extra- 
balance feature. In practice this would mean either a least squares 
analysis or the segregation of individual degrees of freedom associated 
with the restriction. If the Fisherian view is accepted, then there is a 
double liability to extra balance. Not only the computational difficulty 
would be increased but also results would be strongly dependent upon 
the assumptions of the model (loss of robustness). So we would not 
advise experimentors to use extra-balanced designs simply as a gimmick 


(or an “aesthetic” grounds). There should be some sound practical 
reasons for the additional restrictions. 
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APPENDIX 
Proof 


The basis for proving that the three properties listed in Section 2 
hold in general is the fact that the left square and the right square are 
mirror images of each other, i.e. treatments appear symmetrically about 
the vertical line at which the slice was made in step (d). There is this 
symmetry by virtue of the method of construction: the first treatment 
in any row of the left square appears as the last treatment in the corre- 
sponding row of the right square, the second in the left appears as the 
second-last in the right and so on to the last in the left which appears 
as the first in the right square. 

To prove that each of the constructed squares has the first property 
(complete balance with respect to number of predecessors), whether n 
is even or odd, it is sufficient to show that each square is, as claimed, 
a Latin Square. For every Latin Square has this property. By con- 
struction, each treatment appears exactly twice in every row of the 
n X 2n figure constructed in step (c). By symmetry, every treatment 
appearing in the left square also appears in the right square, hence each 
treatment must appear just once in any row of the left square and once 
in the corresponding row of the right square. Furthermore, the columns 
of the original cyclic Latin Square remain intact throughout the con- 
struction of each square. Thus each constructed square is arrived at, 
in effect, by permutation of the columns of a cyclic Latin Square. Since 
any permutation of columns of a Latin Square is also a Latin Square, 
both left and right figure are Latin Squares. This demonstrates that 
the first property holds. 

It can now be proved that the second property (balance with re- 
spect to the set of all predecessors) holds for the two squares taken as a 
whole. As shown, the two squares are Latin Squares and are mirror 
images of each other. Consequently, if k different treatments precede 
(and therefore the remaining (n — k — 1) treatments succeed) a given 
treatment in a row of the left square then the remaining (n — k — 1) 
treatments precede (and the k succeed) the given treatment in the 
mirror image row of the right square. That is to say, in any given row 
of the two squares taken together, every treatment is preceded (and 
succeeded) just once by each of the k + (n — k — 1) = (n — 1) other 
treatments. Taking all n rows into account, every treatment is preceded 
(and succeeded) n times by each of the (n — 1) other treatments. 
This constitutes proof that the second property holds when the two 
constructed squares are taken as a whole. Proof that, when n is even, 
each constructed square has this property will be put off until the third 
property has been dealt with. 
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reasons for terminating the experiment after 35 patients or so have 
come in, but 42 patients would be required to balance the design. This 
limitation is less important, however, when the emphasis is on pre- 
cautions rather than analysis. 

Another limitation is that complete balance (as defined) with respect 
to immediate predecessors is not achieved. But by slightly modifying 
the designs given here, complete balance can be achieved by simply 
inserting a “zeroth order” column, identical with the first column, in 
each square. If the data from the zeroth order column are discarded 
in the analysis, it will be found that each treatment is immediately 
preceded equally often by every treatment, including itself. Thus the 
set of immediate predecessors is the same for every treatment. Bradley 
has already pointed this out in his discussion of an even number of 
treatments. It is equally true for an odd number of treatments. But 
the advantages of this further balance would have to be weighed against 
the possible disadvantages. For example, in the post-operative pain 
situation reported by Meier et al. and cited above, it would be especially 
undesirable to discard initial post-operative observations. 

Another simple extra-balancing feature, mentioned by Bradley for 
an even number of treatments but equally applicable for an odd number, 
provides column balance analogous to the row order balance. The 
procedure is to permute the rows of each Latin Square in such a way 
that the first column reads down in the same sequence as the first row 
reads across. Note, however, that the addition of this feature sharply 
limits the number of possible Latin Squares. From a theoretical stand: 
point (e.g. the randomization set justification of the Latin Square 
analysis) the restriction raises some difficulties. Incidentally this same 
objection applies, though usually with less force, to any other extra 
balanced design. Sir R. A. Fisher has vigorously maintained th 
position that the analysis of data must take into account the restriction 
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Square is directly relevant here since this square contains an extra 
balance feature. In practice this would mean either a least square 
anulysis or the segregation of individual degrees of freedom associate 
with the restriction. If the Fisherian view is accepted, then there is 
double hability to extra balance. Not only the computational difficult 
would be increased but also results would be strongly dependent upo 
the assumptions of the model (loss of robustness). So we would n¢ 
advise experimentors to use extra-balanced designs simply as a gimmic 
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APPENDIX 
Proof 


The basis for proving that the three properties listed in Section 2 
hold in general is the fact that the left square and the right square are 
mirror images of each other, i.e. treatments appear symmetrically about 
the vertical line at which the slice was made in step (d). There is this 
symmetry by virtue of the method of construction: the first treatment 
in any row of the left square appears as the last treatment in the corre- 
sponding row of the right square, the second in the left appears as the 
second-last in the right and so on to the last in the left which appears 
as the first in the right square. 

To prove that each of the constructed squares has the first property 
(complete balance with respect to number of predecessors), whether n 
is even or odd, it is sufficient to show that each square is, as claimed, 
a Latin Square. For every Latin Square has this property. By con- 
struction, each treatment appears exactly twice in every row of the 
n X 2n figure constructed in step (c). By symmetry, every treatment 
appearing in the left square also appears in the right square, hence each 
treatment must appear just once in any row of the left square and once 
in the corresponding row of the right square. Furthermore, the columns 
of the original cyclic Latin Square remain intact throughout the con- 
struction of each square. Thus each constructed square is arrived at, 
in effect, by permutation of the columns of a cyclic Latin Square. Since 
any permutation of columns of a Latin Square is also a Latin Square, 
both left and right figure are Latin Squares. This demonstrates that 
the first property holds. 

Ié can now be proved that the second property (balance with re- 
spect to the set of all predecessors) holds for the two squares taken as a 
whole. As shown, the two squares are Latin Squares and are mirror 
images of each other. Consequently, if k different treatments precede 
(and therefore the remaining (7 — k — 1) treatments succeed) a given 
treatment in a row of the left square then the remaining (n — k — 1) 
treatments precede (and the k succeed) the given treatment in the 
mirror image row of the right square. That is to say, in any given row 
of the two squares taken together, every treatment is preceded (and 
succeeded) just once by each of the k + (n —k— 1) = (n— 1) other 
treatments. Taking all n rows into account, every treatment is preceded 
(and succeeded) n times by each of the (n — 1) other treatments. 
This constitutes proof that the second property holds when the two 
constructed squares are taken as a whole. Proof that, when 7 is even, 
each constructed square has this property will be put off until the third 
property has been dealt with. 
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In order to facilitate the proof of the third property, we, like Bradley, 
adopt the concept of ‘separation’. Let i be any treatment in a con- 
structed Latin Square, and let j be its immediate predecessor for a 
given individual. Then let 7 — j be the difference between the two 
treatments. If the difference is positive, the ‘separation’ is equal to that 
difference. If the difference is negative, the ‘separation’ is equal to n 
plus the difference. This concept of separation can be most easily 
visualized by placing the numbered treatments at equal intervals 
around a circle in clockwise sequence from 1 to n. The separation 
between 7 and j ~ 7 is the number of intervals passed in moving clock- 
wise from j to 7. Note that the separation between any 2 and every 
other possible preceding treatment, j # 2, runs from 1 ton — 1 in 
one-to-one correspondence with all treatments other than 7. Thus, 
a design in which, for every treatment, the separations, 1 through 
(n — 1), appear with equal frequency, is balanced with respect to 
immediate predecessors. 

Now, the cyclic nature of columns has been undisturbed in the 
construction of the two squares. Consequently, the sequence of separa- 
tions between successive treatments is the same for every individual 
row in a given square. We may therefore confine our attention to the 
sequence of separations in the first row, since the same sequence will 
apply to every row. 

For n odd, treatments in the first row of the left square appear in the 


sequence, 1, n,2,n —1,--- , (n+ 8)/2, (n+ 1)/2. Then the sequence 
of differences is alternating in sign, +(m — 1), —(m — 2), 
+(n — 3), ++: , —(1). The sequence in the right square is reversed, 


with changed signs, +(1), —(2), +(8), --- , —(m — 1). That is to 
say, a full set of positive integers from 1 ton — 1 and a full set of nega- 
tive integers from —1 to —(n — 1) appear as differences in the first 
row of the two squares taken together. When the negative integers 
are replaced by the corresponding separations, it is seen that in the 
first (or any) row of the two squares, each separation from 1 to (n — 1) 
appears exactly twice. Since this is true for every row, and since each 
treatment appears once in each column, every separation from 1 to 
(n — 1) occurs exactly twice for each treatment. This establishes that 
the third property holds for n odd. 

For n even, the sequence of differences in the left square is, +-(n — 1), 
—(n — 2), +(m — 3), --- , —(2), + (1). Again, this alternating se- 
quence is reversed, with changed signs, in the right square, —(1), 
+(2), --- , —(n — 3), +(m — 2), —(n — 1). Replacing negative 
differences by the corresponding separations, we get, (n — 1), 2, 
(n — 8), -++ , (vn — 2), 1, for the left square as well as for the right square. 
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This sequence contains all the odd integers in descending sequence from 
(n — 1) to 1, interlaced with all the even integers from 2 to (n — 2) 
in ascending sequence. Thus, in the first (or any) row of either square, 
each separation from 1 to (x — 1) appears exactly once. By the same 
reasoning as for the case when n is odd, this establishes that the third 
property holds for each constructed Latin Square when 1 is even. 

It can now be shown that the second property holds for each square 
when n is even. It has already been found that the sequence of separa- 
tions in the left square is identical with that in the right when 7 is even. 
The beginning treatment, together with the sequence of separations, 
uniquely determines the order of treatment in any row of either square, 
by reason of one-to-one correspondence of treatments with separations. 
In the left square, there is just one row which begins with a given 
treatment, and the right square contains just one row which begins 
with the same treatment. Consequently, every row in the left square 
is identical to one and only one row in the right square. Then the 
rows of the right square could be permuted to arrive at a square which 
is identical with the left square. These two identical squares are 
balanced when taken together, if and only if each is balanced. Permuta- 
tion of rows does not disturb the balance of the right square, and since 
the two taken together have already been shown to be balanced, each 
must be balanced with respect to the set of all predecessors. 


Graeco-Latin Square Construction. 


A slight modification in the procedure for construction, when n is 
odd, produces a Graeco-Latin Square. This modification was first 
noted by our colleague, John E. Dowd. 

For n odd, proceed in the same way as steps (a), (b) and (c). 

Step (d). In the resulting n X 2n figure, replace treatment numbers 
appearing in the odd numbered columns by Latin letters and the treat- 
ments in even numbered columns by Greek letters. 

Now consider all the n” mutually exclusive pairs of Latin and Greek 
letters in adjacent columns. It will be found that the pairs appear in 
the vertical and horizontal order required to form ann XK n Graeco- 
Latin Square. 
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ON THE STATISTICAL THEORY OF A ROVING CREEL 
CENSUS OF FISHERMEN’ 


D. 8. Rosson 
Cornell University, Ithaca, N. Y., U.S.A 


SUMMARY 


In order to estimate the day’s total catch from a fishery an enumer- 
ator roves through the fishing area interviewing fishermen as he en- 
counters them to determine the number n of fish caught and the time 
texpended. The interviewer is assumed to (i) start his trip at a randomly 
chosen point along a well defined route which completely covers the 
fishery, (ii) choose his initial direction at random from the two alter- 
natives, and (iii) travel at a constant rate of c circuits per day. If the 
catch rate n/t at time of interview is an unbiased estimator of a fisher- 
man’s catch rate for his completed trip and if the fishermen’s move- 
ments relative to the interviewer’s path never exceed the interviewer's 
rate c, then rn/ct, summed over all interviews, is an unbiased estimator 
of the day’s total catch. The unit of time is one day, r is the number 
of times the fisherman was interviewed, and n/t is the catch rate at the 
r’th interview. 

Unbiasedness of n/t implies that the waiting times to first catch and 
from first to second catch are identically distributed chance variables, 
and that all waiting times between successive catches have the same 
expected value. If waiting times are independent, then unbiasedness 
implies that fishing is a Poisson process. 


INTRODUCTION 


Fishing, as every fisherman knows, is a chance process. Skill im- 
proves the chances, but a multitude of unknown factors governing fish 
behavior remain to confound even the most experienced fisherman, 
and for most of us catching a fish is still largely a matter of chance. 
One of the basic factors controlling fish catches, of course, is popula- 
tion size; in turn, however, fish population size may itself be strongly 
influenced through the efforts of the more gifted and more fortunate 
fishermen. In order to maximize the fishermen’s chances insofar as 
they are influenced by this factor, the fishery manager attempts to 
maintain the population size and composition at an optimum level 
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through such practices as stocking fish and regulating the catch. These 
management decisions must be based upon objective information re- 
garding the number, size, and age composition of the fishermen’s 
catch, and one of the more important field techniques which have 
evolved for obtaining such information is the so-called creel census of 
fishermen. 

In the creel census the fishery manager or his representative makes 
direct observations on the fishing process, interviewing fishermen in 
action to determine the kinds and numbers of fish taken and the rates 
at which they are caught. Ordinarily, the fishermen so interviewed 
represent only a sample of the fishermen present, so that the creel 
census is, in fact, a sample census. Moreover, in many types of creel 
census only information on incomplete fishing trips is obtained; that 
is, the fisherman is interviewed while fishing, providing information 
on his fishing trip up to the time of interview, and his fortunes after 
the interview remain unknown to the fishery manager. Such is the 
case in the type known as the roving creel census, in which the enumer- 
ator moves through the fishing area interviewing fishermen as he en- 
counters them, and it is this commonly employed method of sampling 
and its associated methods of estimation which shall be examined in 
some detail here. 

Estimation of the total day’s catch from the fishery on the basis 
of the roving interviewer’s data appears to present a unique combina- 
tion of statistical problems in the theory of sampling and estimation. 
Some distinctive features of the roving creel census are (i) the open 
end to the sample—the number of interviews in the sample is not 
predetermined but depends, rather, upon the number and distribution 
of fishermen present, (ii) the sample of fishermen obtained by following 
some rational route through the fishery constitutes a systematic rather 
than a random sample, (iii) the probability of interviewing any given 
fisherman depends in some manner upon how long he fishes, and (iv) 
only incomplete information is obtained for any one fisherman. 

In examining this problem we shall first specify a well defined, 
roving sampling procedure and then treat the estimation problem under 
the simplifying assumption that catch rate at the time of interview is 
an unbiased estimator of that fisherman’s catch rate for his completed 
trip. Later, we consider the implications of this assumption as it relates 
to the nature of the fishing process. 


THE SAMPLING AND ESTIMATION PROCEDURE 


A specific description of the procedure followed by a roving inter- 
viewer probably would not apply in all detail to any single creel census 
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ever conducted, since each fishery presents circumstances peculiar 
unto itself. The sampling process to be described here is necessarily 
a specific idealization of the general plan of roving through the fishery, 
interviewing fishermen as they are encountered; as always, the idealized 
plan is not actually attainable in practice, but can be approximated to 
a reasonable degree. 

We shall assume that some systematic route which gives complete 
coverage is plotted through the fishing area, that the interviewer 
starts his trip at the beginning of the fishing day from a randomly 
chosen point of departure along this route, that he chooses at random 
one of the two alternative directions to travel and then proceeds at a 
constant rate of travel until the end of the day. 

The line denoting the route of the interviewer effectively reduces a 
fishing area in two dimensions to a line in one dimension, and since 
the route is closed—that is, a complete coverage of the fishing area will 
bring the interviewer back to his starting point—then the line may be 
represented conceptually as a circle. A fisherman’s location then 
corresponds to a point on the circumference of the circle, determined 
by the point on the interviewer's route at which he would pass that 
fisherman's location in the fishing area. The dimension of time may 
be introduced by letting the radius of the circle represent the length 
of the fishing day. In this way a fisherman's location in both time 
and space can be represented by a point within the circle; his location 
in the fishery determines the radius vector upon which he lies and the 
time of day determines a point on this radius vector. It is convenient 
here to regard the time axis as extending toward the center of the 
circle, so that the time is 1 (end of the day) at the center and 0 (beginning 
of the day) on the circumference. For example, if a fisherman is station- 
ary, then his entire trip can be plotted as a segment of a radius vector 
as shown in Figure la; the particular radius vector is determined by 


FIGURE la 


Tre MAPPING OF A STATIONARY FISHERMAN’s TRIP WHICH STARTS AT THE 
BEGINNING OF THE DAY AND CONTINUES Untit Time 7 av THE LOCATION Ih 


his (fixed) location and the segment extends inward from the time he 
starts fishing to the time he stops. 
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The interviewer's trip, in this framework, then becomes a regular 
spiral extending from his randomly chosen starting point on the cir- 
cumference to the center point of the circle. The regularity of the 
spiral is a direct consequence of the assumption of a constant rate of 
travel. This is illustrated in figure 1b where the interviewer’s rate of 
travel is arbitrarily taken to be one complete circuit of the fishery per 
day, and the direction of travel is arbitrarily taken clockwise. 

In the particular combination of circumstances described in Figure 
1b, the interviewer's trip does not intersect with the fisherman’s trip, 


Ss 
& : 
FIGURE 1b 


THE MappinG oF AN INTERVIEWER Trip WHICH STARTS AT THE Point S AND 
Moves In A CLocKWISE DIRECTION AT THE RATE OF ONE Circurr Per Day 


for the fisherman had already left by the time the interviewer reached 
that location. Had the interviewer, traveling in this same direction, 
chosen his starting point anywhere on the are A shown in Figure 1c 


S 
FIGURE Ic 


Tue Rance or INTERVIEWER STARTING Points Wuicu Leap 
TO AN ENCOUNTER WITH THE FISHERMAN at L, Arc A FOR 
CLockwisE Trips, Arc B ror CouNTER-CLOCKWISE. 


then he would have encountered this fisherman, or traveling in the 
opposite direction and starting anywhere on arc B would have led to an 
encounter. Since the probability distribution for the starting point 
is uniform on the circumference of the circle, the probability that it 
will fall on are A (or B) is simply the length of the arc expressed as a 
fraction of the entire circumference. Clearly, this relative length of A 
(and of B) is simply 7, the length of the fisherman’s stay; for in order 
to reach the fisherman’s location L at exactly the time 7 when he 
stopped fishing, the interviewer would have had to start his trip at a 
point S just far enough in back of L so as to reach L by traveling for 
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exactly a time 7, hence covering a 7’th of the circumference. The 
probability that this fisherman would be interviewed is therefore 


P(interview) = P( clockwise travel )-P( interview | clockwise travel ) 
+ P(counterclockwise) - P(interview | counterclockwise) 
= 47 + 3T. 


Thus, if the interviewer’s rate of travel is one circuit per day, then a 
stationary fisherman’s probability of interview is equal to the fraction 
of a day that he fishes. 

If the interviewer’s rate of travel is not 1, then this result no longer 
holds. It is obvious, for example, that, if the interviewer makes 2 
complete circuits per day, then every fisherman who fishes for more 
than half a day is certain to be interviewed at least once, and may be 
contacted twice. Figure 2a, again employing a stationary fisherman, 


2(1-T) =_P(ONE INTERVIEW) 


2T-1 =P (TWO INTERVIEWS) 


FIGURE 2a 


A MappINnG OF A STATIONARY FISHERMAN’s Trip or Leneru T > 1/2 anv 
AN INTERVIEWER’S TRIP AT A RATE oF c = 2 ComMpLETE Circuits Per Day. 


illustrates this situation and indicates the ranges of starting points 
which would result in 1 and 2 contacts between interviewer and fisher- 
man. In order for a single contact to occur here, the interviewer must 
pass the fisherman’s location for the second time between time T and 
time 1. Since he is traveling at the rate of 2 circuits per day, this 
means that the range of starting points which will accomplish this is of 
relative length 2(1 — 7). The remaining range of starting points, of 
relative length 1 — 2(1 — T), will result in 2 contacts between inter- 
viewer and fisherman. Figure 2b illustrates the case of a fisherman 
whose trip length is less than a half day, and indicates the range of 
starting points which will produce 0 and 1 contact. In order for a 
single contact to be made the interviewer must pass the fisherman’s 
location between time 0 and time 7’, and the probability of this occurring 
is 27. We have here ignored the feature of randomized direction of 
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1-2T=P(NO INTERVIEW) 


2T=P (ONE INTERVIEW) 
FIGURE 2b 


A Mappine or INTERVIEWER AND STATIONARY FISHERMAN 
FOR THE CasEc = 2 AND T' < 1/e. 


travel because of the trivial role it plays in the case of stationary 
fishermen. 

In the more general case where the interviewer makes c complete 
circuits of the fishery in a day then a stationary fisherman whose trip 
length is T will be interviewed either [c7’] or [cT] + 1 times, where 
[cT’] denotes the largest integer contained in cT. By the same argument 
employed earlier, the probability of exactly [c7] interviews is 
[cT] + 1 — eT, and the probability of [cT] + 1 interviews is then 
cT — [cT]. The expected number of interviews of this stationary 
fisherman is therefore 


[eT |((eT] +1 —cT) + (eT) + DCT — [cT]) = cl 


for any constant travel rate c > 0. 

If the fishermen themselves are moving about in the fishery then 
the interviewing process becomes somewhat more complicated anal- 
ytically, and apparently unmanageable from the viewpoint of estima- 
tion. We first observe that if the fisherman’s rate of movement relative 
to the interviewer’s path never exceeds the interviewer’s rate c then 
the expected number of interviews remains at c7. To demonstrate 
this we have exhibited in Figure 3a the path of a slow-moving fisherman, 


FIGURE 8a 


ILLUSTRATION OF A Path TAKEN By A Movina FisueRMAN WHO CovERS A 
FRACTION A OF THE FisHERY DuRING « FrRacTION 7 oF THE Day. 
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traveling in a clockwise direction, who covers a relative distance A 
in a time 7’, thus moving at an average rate of § = A/T. As seen in 
Figure 3b, where the interviewer's rate is taken to be c = 1, the range 


aoe 


clockwise route counterwise route 


FIGURE 3b 


ILLUSTRATION SHOWING THE RANGE OF STARTING PoINts OF INTERVIEWER 
Trips Wuticu WILL INTERSECT THE Movine FISHERMAN IN 3a. 


of interviewer starting points which lead to an interview is of relative 
length T(1 — §) for clockwise interviewer trips and relative length 
T(1 + 8) for counterclockwise trips. Thus, the probability of an inter- 
view is T(1 — 8)/2 + T(1 — 3)/2 = T. More generally, we see that 
if, for an arbitrary interviewer rate c, the fisherman’s rate s is uniformly 
less than c, then the probability distribution of the number r of inter- 
views for clockwise trips is 


Pe = ([fe—)) =fe- 9) +1 Te —») 
al Fir 


l| 


[Te— Slee), 
and for counterclockwise trips is 
Pe = (kets) Sie ta4+ t—Tet 9 

= J — Pr 


[Te+s] +), 
and, again, the expected number of interviews is 
&(r) = 48&(r | clockwise trip) + 3&(r | counterclockwise) 
=—20G— 5s) ale 8) 
aol ie 
As soon as the fisherman’s rate s exceeds the interviewer's rate c, 

the roles of the two paths effectively become reversed in so far as they 
- determine the number of interviews. In effect, the hunter becomes 
the hunted, and an interview occurs now if the fisherman overtakes 
the interviewer. Consequently, if a fisherman travels at a constant 


rate s > c for a time 7’ then the expected number of times he will be 
interviewed during this period is s7' rather than c7’. If during a period 
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of length 7 the fisherman's rate s > c¢ is not constant but is s; for a 
time 7, then s, for a time 7. , --- , then s, for a time 7 , 
7 + --: + + = T, then his expected number of interviews during 
this period is §7’, where 


It then follows that, if s varies continuously, s = s(r) > c¢, in the interval 
0 <7 < T, the expected number of interviews is 


AO) = / cin 


For a completely arbitrary type of fishing trip of duration T, the fisher- 
man’s rate of movement s(z7) may exceed c part of the time, and part 
of the time not, so in general 


&(7r) =c ie dr + s(z) dr, 


Je 


where 
5 5507 Ol eee cae, 
Se = fr | Ol or ees ares 


Since there will be only finitely many discontinuities in s(r), we may 
simply write 


&r) =cT + 6 — oT. , 


where 7’, is the Riemann measure of 3, , or the total length of time 
that s(r) > c, and § is the average of s(r) over 3, , 


5 = | 81) dr. 


The two components of &(r), cT and (§ — c)T, , are not individually 
estimable on the basis of the interviewer’s data—unless, perhaps, he 
also makes some quantitative measure of the fisherman’s rate of move- 
ment when he is approached for an interview. For this reason we shall, 
from this point onward, assume that no fisherman’s rate of movement 
ever exceeds that of the interviewer (7, = 0), and this is most easily 
accomplished by imposing the restriction that the fisherman’s time in 
motion is not counted as fishing time, nor is he interviewed while in 
motion. 


At this point it is worth noting that while the arguments so far 
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have been directed at the case of a single interviewer traveling at the 
constant rate of ¢ circuits per day, all arguments apply just as well to 
the case of k interviewers equally spaced along the route and traveling 
at the constant rate of c/k. The combined data of k such interviewers 
is equivalent in every way to the data of a single interviewer traveling 
k times as fast; that is, traveling at the rate c. 

The preceding results apply to each fisherman who is present in 
the fishery sometime during the course of the day; consequently, if 
there are M fishermen present with trip lengths 7, , --- , Ty , then 
the expected number of interviews is c(T1 + -°° tee Lies <a le 
then this is also the expected number of different fishermen interviewed; 
otherwise, the expected number of interviews exceeds the expected 
number of different fishermen contacted by an amount ¢ (1, = We), 
where the sum >.* extends over all fishermen whose effort T’; exceeds 1/c. 

In his interviews the enumerator determines the number of fish 
caught and the amount of effort (time) ¢ expended up to the time of 
interview, thus enabling him to estimate the catch rate for each inter- 
viewed fisherman. Conventional methods of estimating the day’s 
total catch from the fishery by a roving creel census are based upon the 
assumption that this catch rate n/t at the time of interview is an un- 
biased estimator of the catch rate N/T for the completed trip. We 
shall examine the implications of this assumption in a later section; 
for the moment, however, we will accept it. If the fisherman is inter- 
viewed r times during his trip, then r such unbiased estimates of N/T 
are avaliable; since the information is cumulative, however, the last 
interview contains all of the information of those preceding it, so the 
last catch rate would be used to estimate that fisherman’s N/T. Using 
the established fact that the expected value of r, the number of inter- 
views, is 7’, we shall then see that rn/ct is an unbiased estimate OfeNe 
and summing this over all m fishermen interviewed then gives an un- 
biased estimate of the total number of fish caught by all M fishermen 
during the day. 

A simple numerical example shown in Figure 4 illustrates this point. 
Here the interviewer's rate of travel is c = % circuits per day; two 
stationary fishermen are present during the day, one starting at the 
beginning of the day and fishing a fraction T; = % of the day and the 
second, who is 3 of a circuit behind the first on the interviewer's (clock- 
-wise) route, starts fishing at time 2 and fishes until time 2 (Ps 8). 
The first fisherman will then be interviewed either once or twice, 71 = 1 
or 2, andr, = Oorl, giving four different possible outcomes for 7, and r.. 
The ranges of interviewer starting points which produce each of these 
four outcomes are shown in Figure 4 along with their relative lengths, 
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he = P(r,2l,521) 


"72 
Wa P(r=2, 121) 


Sane = P(r,=2,1,=0) 


FIGURE 4 


A NuMERICAL EXAMPLE WITH Two STATIONARY FISHERMEN SHOWING THE 
PROBABILITY DISTRIBUTION OF NUMBER OF INTERVIEWS WHEN 
THE INTERVIEWER’S TRAVEL RATE IS c = 3/2. 


or probability measures. The expected value of [(r.N,/7T\) + 
(r.N.2/T>)|/c in this example is then 


lature oN 1 (ON, a 3 Bey 5 (Ah) | 
a(t (Aa te) ee 1 %,) S16 ere) es are 
Le i (*) a (¥2) 
PES GA Te 16.X75 
2 [21 (8 3 (8 
2) 2 (=), +2 @n,| gitar 


The sampling error of an estimate of this type may be regarded as a 
sum of two uncorrelated components, 


1 rm i! rN 1 ee 
Oe ta ld Pe de EP cae) 
where > ;,,) is a sum over the m fishermen sampled and >),y) is a sum 
over the M fishermen present, the first due to the sampling of fishermen 
and the second due to the sampling of the fishing process of the selected 
fisherman—that is, due to the incompleteness of the fishing trip at the 
time of interview. These components are statistically uncorrelated 
because of our assumption that for any given t, 0 < t < 7, the observed 
catch rate n/t is an unbiased estimator of N/T. Unfortunately, how- 
ever, the sampling variance and its corresponding components are in 
general not estimable from the data obtained in a roving census because 


of the systematic nature of the sample and unequal, unknown probabili- 
ties of selection. 


We turn now to an examination of our simplifying assumption that 
catch rate at time of interview is an unbiased estimator of the catch 
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rate for the completed trip. This assumption clearly relates to the 
nature of the stochastic fishing process, and in the next section we will 
study this relationship in some detail. 


IMPLICATIONS OF THE ASSUMPTION 
OF AN UNBIASED CATCH RATE 


For this discussion we again restrict our attention to a single fisher- 
man on a given day and on a fishing trip of given duration 7’. Since 
fishing is a chance process, the number Nr of fish which will be caught 
during this period is a chance variable. The basic chance variable of 
the fishing process, however, is the amount of time required to catch 
a fish, or the waiting time between successive catches. The number 
N is simply the number of successful waiting times contained in an 
interval of length 7; hence the probability distribution of Nr depends 
basically upon the distribution of waiting times. 

Our fisherman is being subjected to a sequence of waiting times 
W, , W2 , Ws , -** between successive catches, and, since he terminates 
the process after a period of length 7’, then the number N > of fish caught 
will be that number K for which w, + we + ++: T Ux < 
Tew, tw.t--) + we + wee. Thus, if Wx denotes the sum 
of the first K waiting times, then the probability distribution of Nv is 


PUN» SE) = Pe = Wea): 


The waiting times w; , We, Ws, *** are non-negative chance variables, 
so the event N; = K is also defined by excluding the event Wx > T 
from the event Wx: > 7; consequently, an equivalent expression 1S 


PING S-E) = PW ST) HRW > Ts 
or, in the classical form for waiting time problems, 
PW hk) =PWeeD SPWen Ss). (1) 


The catch n, on hand at the time of interview of our fisherman is 
also a chance variable whose distribution then depends on t, T and Nr. 
To compute this distribution we note that the joint event n, > k and 
N, = K occurs if and only if Wier < tand We < T < Wey: ; thus, 


the conditional probability of n, > k, given t, T and N; ,1s 


P(Waer St, We ST < Wess) 
PWee To Wiad) 


SPOpe hele) = 


— PW < t, Wx Te (Wye Wes LD) 


Pie ST) Pe 
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From this we obtain the distribution of n, by subtraction, 
Pa =| LN) 
=Pm>k— 114.7, NN, — 2a hee ae 


This derivation of the distribution of Ny and n, implicitly assumes 
that the duration 7’ of the fishing trip is an arbitrary constant, de- 
termined independently of the outcomes w, , w. , --- of the waiting 
process. While this is possibly true of the fishing process, as when the 
fisherman decides in advance just how long he will fish, we must also 
acknowledge the existence of other possible stopping rules, including 
those which are a sequential function of the waiting times. When we 
impose the restriction of unbiasedness, however, we require that all 
admissible stopping rules exhibit the same dependence on waiting 
times, and, since the rule in which 7 is an arbitrary constant must be 
included among the possible stopping rules, then all admissible rules 
must be independent of waiting times. This is clearly seen in the case 
Ny = 1, where the condition of unbiasedness becomes 


PG, S010, Na = l= PCN 2 alae) =F 
Here we see that the conditional distribution of w, , given that fishing 
stops at time 7’ with N; = 1 fish in the creel, must be the uniform 
distribution from 0 to 7, regardless of the stopping rule. Now if the 
process satisfies this condition when 7’ is an arbitrary constant then 
every other stopping rule for which this condition is satisfied can contain 
no more information concerning w, than does the rule in which iis 
arbitrary. We shall continue to assume, therefore, that 7’ is simply a 
preassigned constant for each fisherman. 

The preceding arguments also assume that the restriction of un- 
biasedness applies conditionally for all t, and we shall now demonstrate 
that this must, in fact, be true if we require that unbiasedness hold 
for an arbitrary travel rate c. To see this, let 


hr.x(t) = e(™ | Ne x), 


so that the condition of unbiasedness becomes 
Ke Tt 
T = | hr, x(t) dP 7(t). 


For a stationary fisherman the distribution of t, the time of the last 
interview, is uniform from T — (1/e) to T when T > (1/c). Since the 
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choice of c, the interviewer’s travel rate, is arbitrary relative to the 
time TJ’ fished by this particular fisherman, then for all c > (1/T) 


it 
= a) hr, x(t) dt, 
T-(1/c) 


or, letting 


rx =f hra(® a, 
then 


= pe es (4) 
Hr) ~ Healt 1) = 5). 
Letting x = T — (1/c) we then have 
K 
Ayr x(z) = Hex) - T sa), 
and upon differentiating with respect to z, 
hr, x(x) = 5 5) 


for 0 < x < T. This justifies our earlier assertion that our unbiased 
condition on n/t implies that 


ie) - 
ct 
for now we may write 
™m r N _ 
e(") = eft Guns 0} 5 So) = N. 


Returning to the distribution of n, , we may express the unbiasedness 
condition as 


= tk 
DUP Ge kN tT Na = K) ae 
k=0 
or, equivalently, 
K—J tk 
DP > bl tT, Nr = K) =p 
k=0 


In terms of the distribution of waiting times this identity then becomes 
K 

> PW. <4, We ST) —- PW, St, We ST) 

aa (2) 


- mn [PW <T) — P(Wesi < T)] 


428 BIOMETRICS, SEPTEMBER 1961 


which must hold for all and T,0 < t < T < @ and for every positive 
integer K (for K = 0 the identity is trivial). 

The identity (2) imposes severe restrictions on the nature of the 
fishing process as it is described by the distribution of waiting times. 
Yor the special case K = 1; i.e., the fisherman’s total catch is one fish, 
this identity reveals several facts. If the cumulative distribution func- 
tion of w, , the time required to catch the first fish, is denoted by F(w,), 


FQ) =P, 2), 

and, if 

Gly | 2) = Pw. < y | = 2), 
then 

He) = / Gle — x |x) dF(2) = P(W, <2) 
0 

and (2) becomes, for K = 1, 

‘ t 

F(t) — [ G(T — «| 2) dF) = 4 (F(D) — HD). 


Differentiating both sides with respect to t we obtain a new identity 


i) — G0 = tO) = 5 RK) — HC). (3) 
Now setting ¢ = 7 we find that 
KT) = RLFC) — HY), (4) 
£0 
P(r = 1) = Ti(Z). (5) 
Differentiating (4) with respect to 7 gives 
nT) = —T#(D), (6) 


that is, the density function of the sum of the first two waiting times 

at the point 7 must be equal to T times the negative derivative of f 

at 7. This, in turn, implies that the density function f of the first waiting 

time 1s monotonically decreasing; that is, if 0 < w < w’ then f(w) > f(w’). 
Combining (3) and (4) we have 


fO0 — G@ — ¢| 4] = 7) (7) 


and now differentiating with respect to 7 we find 
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Hog 4 |) = 1); 
or 
f(w,) g(w. | w:) = —f’(wi + We). (8) 


Thus, the joint distribution of the first and second waiting times, w, 
and w,, must be equal to the negative derivative of the density function 
f, evaluated at the point w, + Ws . Furthermore, upon integrating 
w, out of this joint density function we find the marginal density fune- 
tion of wz to be 


ip f(w,)g(w2 | wi) dv, = -[ f/(w, + we) dw, = f(wr), 


that is, w, and w, must be identically distributed. Note that for fixed w, 
the mean value of w. must be 
E(w. | w,) = [1 — F(w,)]/f(wi)- 


If w, and w, are independent as well as being identically distributed, 
then we see, upon returning to (7) and differentiating with respect to ¢, 
that 


foo — FC — )) + fore -—) =9, 
or, putting ¢ = T, 
(M/C) = —f0) = —8 Gay) 
so that the distribution f(T) of the first and second waiting times must 
be the exponential 
f(T) = 6e°*. 

From (8) we also see that the converse is true; that is, if w, and w. are 
identically, exponentially distributed then they must be independent 
if our condition of unbiasedness is to be fulfilled. 


Most of these arguments and conclusions apply in general to the 
original identity (2) holding for arbitrary K. In general, (4) becomes 


IPWs < Dat = % PW x ST) — PWen SM, 


- go that (5) becomes 


T dPW, <1). 
Tansee 


P(N, = K) = 


The relation (6) becomes, in general, 
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dP Wen TS ey (9) 
aT K! 


where f(T) is the K’th derivative of f(7). Combining these last 
two equations we find that 


P(Nr = K) = (-1)*" aia Bee) (10) 


ai 
for K > 0, and 
PIN; =0) =1— 7 @).: 


From (10) it is seen that the generating function Q,(s) of the distri- 
bution of N, takes the form 


Q,(s) = LP =i) 
= 1 — Fer) + oryry — © per) + oD a) 


which is the Taylor series expansion of 
O79) =e ts) (11) 

about the point 7. The factorial moments of N 7 are therefore 

8N (Nr — 1) -+- (Nr —7 +1) = (-1) TF" ©). 
In particular, the mean value of Ny is 

ur = THO), 
and the variance is 
or = wr(l — wr) — Tf’). 


The earlier argument for K = 1 which led to (8) and the conclusion 
that 


dP(w, <2) = dPw, < 2); 
now gives for K = 
dP(w, < x) + dP(w. + ws < 2) = dP(w, < x) + dP(w, + wv < 2), 
and for K = 3 
dP(w, < x) + dP(ws + w, < 2) + dP(we + ws + wz < 2) 
= dP(w, < 2) + dP(w, + w, = 2) + dP, + wv, + 3 < 2), 


or, in general, 
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K K 
Do dP Wey — Wi De. =) 
k=1 k=1 (12) 


k-1 


SO cel) as arr ACB eke 


The implication for K + 1 waiting times is thus not quite as specific 
as it was for 2 waiting times—namely, that the first two must be identi- 
cally distributed. However, multiplying the system (12) by x and 
integrating does show that all waiting times must have the same expected 
value. 

The conclusions concerning independence and the exponential dis- 
tribution do extend to arbitrary K. If waiting times are mutually 
independent, then f(wi) = f(we) ts exponential, and from (9) we see 
that the distribution of the sum Wg is simply the K-fold convolution 
of f, which implies that all waiting times are distributed according to f. 
As is well known, independent and identically exponentially distributed 
waiting times generate a Poisson distribution for number of occurrences, 
and in this case (10) gives us 


aoe 


P(N; = K) = Toe. 


and from (11) we now obtain the Poisson generating function 

1 JC —») = 1— {1 — Dee es ten ae 
When fishing is a Poisson process, the conditional distribution of n, , 
the number of fish on hand at the time of interview, is binomial with 


parameters N; and p = t/T, for substituting the exponential waiting 
time distribution into our formulas we obtain 


_ K! 1 a: ap) Krk-1 
Pin =k14,7,Ne- X= Slog fe 2)<* de 


Sqm ae ae | = ()(#) (2 a 


Thus, for all t, 0 < ¢ < T, the catch rate at ¢ is unbiased, 


f 4 N 
le ms) = eae) = 


ie ) 
and here the conditional variance of the catch rate estimate is 


var (ns 
b 
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The above variance is conditional upon ¢ as well as 7’ and Nz ; 
since we know the conditional distribution of ¢, the time of the last 
interview, we can in this case compute the value of the second com- 
ponent of variance of the estimator of >) N—that is, we can now 


compute 
lie ‘3 (™ Ne) | 
el? (m) : t lf : 


for fixed T’s and N,’s. Assuming that the different fishermen are 
undergoing independent Poisson processes, we see that 


ib uN Pod ele 
| > (2 up ayo le OMY, ee 


P¢ = [eT]) = 1L—cPf + [eM = 1 —Pr= lt] +1), 


where 


and where the conditional distribution of ¢ given r is uniform. For 
r = [eT], ¢ must fall in the interval (7 — 1/c, [eT]/c) of length 
(1/e)P(r = [cT]) and for r = [cT] + 1, ¢ must fall in the interval 
([eT|/c, T) of length (1/c)P(r = [eT] + 1). From this joint distribution 
of r and ¢ we find the second component of variance to be 


i th, Nev ie [eT] [cT] 
eft =e z ve) = > Wr Wie wen eo 


(m) (M) 


({eT] + 1)? cT [cT]\({ceT] + 1 
sehr OR ra a ( - zal ie i = i} 


As c gets large then the probability of interview approaches 1 for every 
fisherman, and the time ¢ converges to 7, so this component (as well as 
the first component) approaches 0. 

Examples of other stochastic processes which satisfy our condition 
(2) of unbiasedness can now be easily constructed by the simple device 
of mixing Poisson processes. Since unbiasedness holds conditionally 
for a Poisson process with parameter 67 then it must also hold un- 
conditionally when @ is assigned some probability measure PG) abn 
effect, then, the fisherman is assigned a value of 6 for his trip, where 
the @ is selected according to the distribution P(@). The density func- 
tion of the waiting time to first catch is then 


f(w,) = [ ” 967": AP(6). 


All waiting times will have this same marginal distribution, and 
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their joint distribution will be 


foo} 


fw, ++, Ww) = [ gig est 8) IPB) 
v0 


The covariance between any two will be the same as the covariance of 
w, and ws , which can be written in general as 


© co 2 
COV (Ww, , We) = [ x(1 — F(x)) dx — | xf(x) aw} ' 
J0 0 
and which in the present case becomes 


cov (W, , We) = ‘p wie _ Be | evan (). 


The distribution of V7 is simply 


Ris ep | ” P(N» = K | 6) aP(8) 


ET eure 
= =i ae" aPC), 
which, of course, could also be obtained from (10) using this function f. 


Since P(n, | t, T, Nr, @) is binomially distributed independently of 8, 
then this is also the unconditional distribution, 


Pan, | t,1,N1) = H Pn, | Nx , 0) dP(0) 


“(Gr 
Pieri ya fa i 
regardless of the mixing distribution P(@). 


To take a specific example of this mixing procedure, let P(@) be the 
gamma distribution 


eae do. 


I'(a) 
then 
a (zx —(a+l1) 
and 


K-1 T'(a =" K) (1 + Ae 


i: 7 @) = 1) B*T(a) B 
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The distribution of N7 , a gamma mixture of Poisson distributions, is 
from (10) the negative binomial 


A negative binomial process derived in this manner is of no particular 
interest as such, because the basic process which the fisherman follows 
on any given trip is still Poisson. We may now change our viewpoint, 
however, and regard this method of derivation as merely a device for 
showing that fishing can be a negative binomial process and still satisfy 
our condition that catch rate is unbiased. We stand on weaker ground, 
of course, if we know that a fishing trip is a negative binomial process, 
for we cannot then be certain of unbiasedness as we could in the Poisson 
case. The same would be true of any other distribution we might 
generate by this device of mixing Poisson processes. 


AN EXAMPLE OF BIAS 


To illustrate the magnitude of the bias which might arise if the 
fishing process does not satisfy the conditions of the preceeding section 
we may consider a simple example where successive waiting times are 
independent and identically but not exponentially distributed. In 
particular, if we let waiting time w be proportional to a chi-square 
variable with an even number of degrees of freedom, say w = x?2,/28, 
then the computation problem remains relatively simple. For v = 1 ‘ 
of course, we again obtain the exponential distribution of waiting time. 
For v > 1 the waiting time distribution will no longer be a decreasing 
function with its maximum at the origin; rather, it will increase to 
& maximum at the point w = (v — 1)/8 > 0 and then decrease. We 
are thus assuming that the wait between two successive catches is 
more likely to last 10-20 minutes, say, than 0-10 minutes, and that 
perhaps a wait of 20-30 minutes is still more likely than a 10-20 minute 
wait, though eventually the probability reaches a maximum and then 
steadily decreases. This might be the situation, for example, in fishing 
for some of the more voracious species of game fish which tend to over- 
disperse themselves, exhibiting a negative or repulsive contagion in 
their spacial distribution. Catching one of these fish almost auto- 
matically precludes the immediate capture of another. 

For illustrative purposes, then, let v = 2, so that the distribution 
of each waiting time w;, is 


dP(w; < w) = B’we *” dw. 


Then if the interviewer’s travel rate is c < 1 it can be easily shown 
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from the preceding sections that for a fixed T 


(| x, = 1) = 24 
a) aes 1S 66 


re 2) _ 400 + 127 + 658 + 2767 
a 300 + 6087 


and 


Since 0 < 8 < ~~, we see that 


L rn a 5 
5 <e(@ |, =1) <3, 
and 
Lee Ae rm = payee: 
$4 Fe? Ne = 2) <q + 


Thus, if a fisherman catches exactly 1 fish on his trip then his expected 
contribution to the interviewer’s estimate of the total catch is some- 
where between 1/2 and 5/6 rather than the desired 1. A fisherman 
who catches 2 fish may be expected to contribute somewhere between 
4/3 and ~ to the estimate, depending upon how long he fishes. If 
he fishes all day (7’ = 1) to catch the 2 fish then his expected contribu- 
tion is between 1.373 and 1.533 fish rather than 2; if he fishes only a 
very short time, say 7 = .01, then his expected contribution to the 
estimate is somewhere between 1.333 and 108.783, and is more likely 
to be toward the large end since that fisherman’s 6-parameter is likely 
to be large. 

Clearly, if a collection of fishermen are present with varying 7’s and 
6’s, the interviewer's estimate in this case could not be expected to bear 
any particular resemblance to the actual total catch. 


DISCUSSION 


The major weakness of the roving creel census of incompleted 
fishing trips as a technique for estimating total catch is that the bias of 
estimation depends on the basic nature of the stochastic fishing process, 
and this, in general, is unknown. We can only speculate as to the 
nature of the fishing process and whether or not it satisfies the conditions 
of unbiasedness, but in some circumstances the answer is obvious. 
If the fish occur in schools and several may be captured whenever a 
school is encountered, then waiting time to first fish is waiting time to 
first school, but waiting time from first to second fish may be waiting 
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time between catches within a school, and these two are not identically 
distributed chance variables. Or the habit of the fisherman may be 
to visit the more productive locations during the first part of his trip 
and then spend some tine exploring for more. A variety of arguments 
could be put forward for unequal expected waiting times in violation of 
our conditions for unbiasedness, and the resulting bias could be of 
considerable magnitude. 

The only sure way of avoiding this problem in a creel census is to 
use some other technique in place of the roving interviewer, to make 
the creel census distribution-free in the sense of an ordinary sample 
survey method. One effective, distribution-free technique is to station 
interviewers at the access points of the fishery to obtain information 
on completed trips. When no well defined access points exist, inter- 
viewers may be assigned randomly chosen area segments of the fishery 
in which they are to obtain completed-trip information [1]. References 
to these and other techniques may be found in an extensive bibliography 
on game kill and creel census procedures prepared by V. Schultz [2]. 

It should be noted that the implications concerning the fishing 
process which were deduced from the condition of unbiasedness of the 
catch rate at time of interview apply to the general problem of pre- 
dicting the number of events in a stochastic process. If the number 
n, of events in a time-continuous process is observed after the process 
has been in operation for a time ¢, then in order for n,7'/t to be an un- 
biased predictor of the number of events N, which will have occurred 
at some specified later time 7, the process must satisfy the conditions 
described here. 

There are also other applications in which the role of the time 
dimension is played by some other continuous variable; for example, 
in a process where objects are randomly drawn until their combined 
weight first exceeds 7’ pounds and then a sample of these objects is 
withdrawn until the weight of the sample first exceeds f pounds, t < T, 
then (n, + 1)7'/t will be an unbiased predictor of the number N; + 1 
of objects in the T-pound sample only if the weight of individual ob- 
jects is exponentially distributed. An interesting modification of this 
problem arises when the exact weights of the sample and/or subsample 
are known; that is, when the excess weights above 7 and ¢ pounds, 
respectively, are also measured. In the analogous fishing-interviewing 
process it might be possible for the interviewer to determine that 
time t’, t’ < t, at which the n,’th fish was caught, and use of this type of 
information would certainly modify the form and properties of the 
predictor of total catch. Such problems involving additional interview 
information warrant further investigation. 
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ALLOCATION OF EXPERIMENTAL UNITS IN 
SOME ELEMENTARY BALANCED DESIGNS 


J. S. WILLIAMS 
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INTRODUCTION 


In recent years an increased amount of effort has been devoted to 
studies of how best to allocate experimental or sampling units for 
estimating variance components. Concepts which originated with the 
sample survey workers for estimating means and totals are reviewed 
in Cochran’s text [1953]. Crump [1954] and Gaylor [1960] have ex- 
tended these concepts to the estimation of variance components in 
nested and cross-classified designs. 

The usual procedure of allocation is to minimize a set of specified 
variances, often subject to given cost restrictions and a fixed total 
number of units. This can obscure the fact that the fixed number of 
units automatically limits the amount of attention which can be devoted 
to any one component. Also it is superficial to minimize variances 
subject to cost restrictions, if cost is not a major consideration. Achiev- 
ing a predetermined balance of variances of the estimates sometimes is 
more meaningful, but frequently as will be shown, this is not obtainable 
with a balanced allocation. 

The following development illustrates these principles for balanced, 
two-level nested allocations and two-way cross-classified allocations of 
normally distributed variables. The assumption of normality usually 
is Justified for applications where variance component analysis is im- 
portant, such as in genetic and chemical studies. 


GENERAL DEVELOPMENT 


In the simple two-level, nested analysis of variance, there are n, 
primary (or first level) experimental units, n. secondary (or second level) 
experimental units for each primary unit, and N = n,n, units in all. 
The two components of variance which can be estimated are o” the 
within, estimated by the within mean square SSW/n,(n. — 1) = 6?, 
and the among o; = ko” (say), estimated by the difference between the 
among and within mean squares, 


1Part of this paper was written while the author was a graduate student at North Carolina State 
College. 
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[SSA/(n, — 1] — [SSW/m(n2 — 1)] = m6. 


The circumstances of the populations to be sampled may dictate 
that either (2) n, is predetermined and nz is any integer or (it) N is 
fixed and (n, , nz) is a pair of integers such that nun. = N. 

(7) For the assumption of normality, 

2 Ny (N2 — 1d + Nok)? -p (ny == 1) es 


Va Az é : 
ar [é3] ns n(n; a 1)(n2 is 1) ) 


(1) 


In Var [¢?], the variance of SSA/(n, — 1), 


Qa°* ill 2k % 
(n, — 1) E eae a 

dominates for all but very small values of n, and k. For fixed n, , the 
most important term of this variance is k’, which is determined by the 
uncontrolled, physical properties of the population sampled. When k 
is not small, large values of n2 can have little effect on Var [é;] while 
they provide a more reliable estimate of o° than is required. Therefore, 
let us investigate how to choose m2 so that the relative effort allotted 
to the estimation of o2 and o” satisfies a predetermined measure of the 
relative importance of these components. 


If 
_ Var [2] 
Day ale (3) 
where R will be called the relative precision of estimation, then 
R = Ny(N» = 1)(1 aa Nok)” 4. (ny et 1) (4) 


n3(n eb) 


The expression (4) reduces to a cubic equation in mz , which can be 
solved to indicate the correct choice of m2 for a balanced allocation, 
or bounds for the correct solution can be derived in the following manner: 


(a) Approximate & by 
(n, — 1)(m2 — 1) + mek)” +m — 1), (5) 


n(n 0) 


ge 


Subject to n. > 0, (5) reduces to the quadratic equation in 2 


ni(k?) — no[R’ + k(k — 2)] -— @k —-1) = 90 (6) 
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with roots 


[R’ + k(k — 2)]) VIR’ + kk — 2) — 4° — 2k) 7) 
ok 


n(R!) = 


The larger root will be considered because only very infrequently does 
the smaller root supply an applicable solution. Using this root, the 
relative precision actually achieved, R, , is 

Ny il 


= / _ ———————— > 
“ Mm et n(n, — 1) 


Vig 


(b) Approximate R by 
mit, — 1) + mak) 


its CO) 


Rk” — (8) 


Subject to n; > 1, m2 > 0, (8) reduces to the quadratic equation 
n3(nk°) — n.[(ny — 1)R” + nk(k — 2)] — 2,(2k — 1) = 0 (9) 


with roots 
oo qa VAT See 2 
Bs Gps [(m1 Dis as k(h )] 
ibe (10) 
3 Vim — DR” + mk(k — 2) = arch? = 2k) 


Qn,k? 


Again only the larger root will be considered. The precision achieved 
for this allocation, R, , is 


it 


Pie n3(n, — 1) 


—he 

k’’ is the more desirable of the approximations since it utilizes the 
value n, , and it provides a solution only slightly smaller than the 
desired precision. For large values of n, , however, it is readily seen 
from an examination of the interval about the specified ratio, 
1 ny 1 


<n 


Sl oe se = RS, ale 


(11) 


that the result from either approximation provides a relative precision 
very close to Rk. ‘Thus, for large predetermined n, , the solution for n. 
is dependent only on the relative sizes of the variance components to 
be estimated and the relative precision desired for these estimates. 
In fact, n.(R’) and n(R”) are strictly increasing in the range R’, 
Rh” > mk(2 — k)/(n, — 1), which exists for the majority of the situa- 
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tions encountered. Increasing n, above n.(R’’) only will increase fr 
and place more emphasis on the estimation of o” than is desired, par- 
ticularly since o; usually is the more interesting parameter. 

Most importantly, n.(R’) > n.(R’’), so that an immediate check 
is available on whether or not a balanced allocation exists for a desired 
relative precision. If n.(R’) < 2, then a larger relative precision must 
be accepted if a balanced allocation is to be used. From (7) it is seen 
that a real-valued solution of n.(R’) always exists if k > 3. For this 
case, no(R’) < 2 is certain to occur if R < k(k + 1) +3. Whenk < 4 
and the solution is real, i.e. R > k[(2 — k) + 2+/1 — 2 kj, no balanced 
allocation exists if simultaneously R < k(k + 1) + 34, a balanced alloca- 
tion is certain to exist if R > k(3k + 2), and between these bounds the 
existence is questionable. Obviously for large values of k, balanced 
allocations exist only when there is a willingness to place much more 
emphasis on the estimation of o° than Gan 

In the case of predetermined n, , n» = 2 probably will be accepted 
when the balanced solution doesn’t exist since large R indicates only 
that a better estimate of o” will be obtained than that which was thought 
necessary. 

(ii) The more important allocation problem occurs when N is 
predetermined, and n, and n, must be chosen, subject to mn = N, 
so that the desired relative precision R is achieved. 

In (4) the number of primary units n, can be replaced by N/ne , 
and the correct allocation of secondary units satisfies the quadratic 
equation 


n(R + Nk’) — nN[R + ke — 2)] + (NG — 2k) — 1] = 0. (12) 


The desired solutions are obtained from 


NIR + k(& — 2)] 


n(R) = XR + Nk’) (13) 
VNR + kk — 27 — 4(NG = 2h) — 1h + NE) 
= 2(R + Nk’) 
2 a MIRAE 2) 
m(k) = Nond — 2b — 1] (14) 


JNAR Le — DE — 4(NG — 2k) — 1k + Nk’) 
oe NG — 2k) — 1] 


With this solution, it is easy to check whether a balanced allocation 
exists for the k at hand and the desired FR. Consider first the case where 
R + k? — 2k is positive. If k > (N — 1)/2N, n2(R) is obtained from 
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the solution with the positive radical element, n,(R) from the solution 
with the negative radical element. If k < (N — 1)/2N, two possibly 
acceptable allocations exist by the choice of the positive and negative 
radical elements. If k = (N — 1)/2N, the solution, taken directly 
from (12), is no(R) = [4N?R — (N — 1)(8N 4+ 1))/[4NR + (N — 1)’ 
and n,(R) = N/n.(R). If R + k’ — 2k is negative, the result for 
k > (N — 1)/2N holds, but there is no allocation for k < (N — 1)/2N. 

If n,(R) or n2(R) is less than two, intuitively, it seems clear that 
there will be some staggered allocations which will provide more de- 
sirable estimates of o2 and o” than the minimum balanced allocation 
achieved by setting these values at two. 

The same development can be extended to balanced, rows and 
columns, cross-classified designs. Let r be the number of rows, ¢ be 
the number of columns, o? be the variance attributable to the row 
effects, o; be the variance attributable to the column effects, and a; 
be the residual variance. Set k, = o2/o2 and kz = o7/o7 , then 


Var [@] E SiG sowie ee 1] 
7 Li — (1 + rk)? +114 
The proper allocation is obtained from 
Niki (ki —Ak—ke(ke =2)| 
le [Nk — (1 —2k,)] 


rad N°*[ki(ky — 2)R — keo(ke —2)]’ —4N[ — 2k) R— NE |[NIER — (1 — 2k.) 
2[N?R — (1 —2k,)] i 


Var [62] ¢ .) 


(16) 
— Nikky —2)R—k,(kz—2)]| 
A = gba] SVT 


no N*[ki (ky —2)R — ka (hep —2)]’ —4N[ — 2k) R—NEZ)IN-R — (1 —2k,)] 
2[(1 —2k,)R—Nk3] ‘ 


(17) 
The allocation will depend on 


[1 — 2k,)R — Nk] and [NKR — (1 — 2k,)], (18) 


because the signs of the radical must be different in the two solutions. 
Three sets of conditions exist which affect the choice of rand c. These 
conditions are functions only of the physical limitations of the estima- 
tion problem (N, k, , and kz) and the desired relative precision of esti- 
mation (R). Let k,(ki — 2)R — ke(k, — 2) be positive; 

(a) If both elements of (18) are positive, then there are two solutions 
from (16) and (17) which can lead to acceptable allocations. 
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(b) If the elements of (18) have opposing signs, then there exists one 
solution from (16) and (17) for an acceptable allocation. 

(c) If both elements of (18) are negative, there is no acceptable 
allocation. 

When k,(k, — 2)R — k2(k, — 2) is negative, the results of (a) and (c) 
are interchanged. In each case, there is the hazard that the solutions 
for a desired R will be complex values. 

In the examination for possible allocations of the nested, fixed- 
total design and the cross-classified design, it is easier to check for the 
existence of an allocation by substitutions into (13), (14), (16), and (17) 
of the parameters of a specific problem than to give general bounds 
on R. A rough indication of the existence of an allocation can be 
obtained by replacing k, k; , and k, by estimated values. If data from 
a pilot experiment of the nested type is available, the estimate 


, Nini —2fe? 2 
k = = : 7 | 7 Sar pare = 7 | 
N —n, é? nN — n, — 2)) (19) 


where the primes indicate values from the pilot data, is unbiased. The 
estimates of k, and k, have the same form as (19) where 6”? and ¢” 
are replaced by é” or 6/’ and 6’, and (r’ — 1) or (c’ — 1) replaces 
(nt — 1), (r’ — ICC — 1) replaces N’ — nj, and 1/e’ or 1/r’ replaces 
1/n. It is the experience of the author that this check indicates a 
balanced allocation only when the ratio of the estimate of k to the 
desired value of FR is small. 

The existence check also provides an estimate of the desired alloca- 
tion if one is indicated. In these estimated allocations, it is unlikely 
that any of the solutions using £ or the true k will be an integer. No 
investigation has been made as to what should be done in this case, 
but it seems reasonable that if n.(R’) and n,(R”) bound an integer, 
use that integer. If they do not, take the next largest integer; at worst, 
for known k, this will give an estimate of a’ which is better than desired. 
For n2(R), r(R), and c(R), take that one of the integers bounding the 
solution which, when substituted into (6), (16) or (17) most closely 
approaches the specified R. Like the existence check, the estimated 
allocations should be regarded only as crude approximations to the 
desired allocations. 


AN EXTENSION FOR THE JOINT ESTIMATION OF 

p SETS OF VARIANCES IN NESTED ALLOCATIONS 
Frequently, as in genetic work, more than one variate is measured 
on each individual experimental unit. For p such variates it is desired 
to obtain p sets of estimates of the variances (o7 , o.;);2 =1,°°-, D. 
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Since it is often economically unwise to allocate by the variate rather 
than the whole experimental unit, the allocation must be in terms of a 
single measure of relative precision for all p sets of estimates. 

It may be that there is one particularly important variate x;, which 
must provide estimates with a relative precision R;, . Then the allo- 
cation would be decided by using R;, and k;, in the formulae of the 
preceding section. If, however, one is satisfied with an average relative 
precision over all variates measured, let 


3 eee pe (20) 
Then, 
= k; ry pe Ge ’ 
Ny (Ns is | ae on, te) =P ni Ey | ae (n, z 1) 
ie isu? i=1 PD : 
n2(n, al 1) 
Define 


(kt) = DO ki/p, (kh) = 37 i/o. 
7=1 71=1 
The two approximations for case (i) with fixed n, are 


Spee [R’ + (k*) — 20)] + VR! + 0) — 2) FP — 400 — 2(h) 


2(k”) 
(21) 
he [(n. — IR” + n,(k?) — 2n(k)] 
Wp 2n,(k”) 
Be 3 (22) 
=f Vit — DR” + m(k) — 2m(k)}° — 4n(k°) — 2(k)) 
2n,(k’) : 
and for case (ii), 
oe Ni ae) 
SAREE 
(23) 


21k + N(k’)] 


In general the features of the single variate case apply here, except 
it should be noted that (k”) > (k)?. 

There are two drawbacks to using R as a measure of relative pre- 
cision. If any k, is exceedingly large, it precludes the existence of a 
balanced allocation, even though singly each of the remaining p — | 
variance sets could be estimated using balanced allocations. The 
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average R does not incorporate any information about the relative 
importance of the different sets of estimates. A variate of small interest 
may provide estimates with a low relative precision, while a very 
important variate may provide estimates with a very large relative 
precision. 

This generalization to p sets of variances obviously does not hold 
for the cross-classified designs. 


AN EXAMPLE 


In Table 1, an analysis of variance of plant data from a genetic 
experiment with inbred lines is presented. The among lines variance 


TABLE 1 


ANALYSIS OF VARIANCE OF Four INBRED LINES 


Source of variation ID Bie M:S 
Among lines 3 19.03 
Within lines 36 1.02 


o is the genetic variance among the plants being studied, and the 
within line variance, o”, is variation caused by the environment. These 
two variance components determine the coefficient of heritability of 
the character being studied; thus both must be estimated. 

These data can be used to plan two types of experiment for estimating 
o and o. In the first experiment, it will be assumed that 200 inbred 
lines are available for testing and ample land is available for setting 
out any reasonably sized experiment. For the second experiment, 
it will be assumed that there is only enough land for a 500 plot experi- 
ment, but that any number of inbred lines are available for planting. 

To check for the existence of balanced allocations, is estimated using 
formula (19) with N’ = 40, nj = 4,3 = 10, 6” = (19.03 — 1.02) 10 == 
1.80, and 6” = 1.02. The estimate, k, is 1.66. Since i is greater than 
5. it should be suspected that R will have to be considerably larger 


ia) 


than one to obtain a balanced allocation. Let any R value between 


one and five be acceptable. 

_ (i) Experiment 1: A balanced allocation is indicated if R 2 
K(k + 1) + .5 = 4.92. The upper acceptable limit of R does provide 
an estimate of a balanced allocation. Substituting R = 5 and k = 1.66 
into (7) or (10) gives an estimated allocation of n, = 200, m2 = 2; 
an experiment utilizing 400 plots. 
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(ii) Experiment 2: For this experiment R + &? — 2k = 4.44 
when R = 5, and k > (N — 1)/2N; the larger root of (13) is used. 
To the nearest integer values, the estimated allocation is n, = 250, 
lp = Me 


In either experiment, to achieve a more even distribution of the 
variances of the estimates, an unbalanced allocation would have to 
be used. 
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AUGMENTED DESIGNS WITH ONE-WAY ELIMINATION 
OF HETEROGENEITY’ 


WALTER T. FEDERER 
Cornell University, Ithaca, New York, U.S.A. 


INTRODUCTION 


One of the principal problems in plant breeding and in biochemical 
research of new pesticides, herbicides, soil fumigants, drugs, etc., is the 
evaluation of the new strain or chemical. Efficient experimental designs 
and efficient screening procedures are necessary in order to make the 
most efficient use of available resources. In some instances sufficient 
material of a new strain or a new chemical is available for only one or 
two observations (plots). Hence, the experimenter should use an 
experimental design and a screening procedure suitable for these con- 
ditions. In other cases, the experimenter may wish to limit his observa- 
tions to a single observation on the new material. In still other cases 
(e.g., in physics), a single observation on new material may be desirable 
because of relatively low variability in the experimental material. 
Furthermore, it may be desired to combine screening experiments on 
new material and preliminary testing experiments on promising material. 
The experimental design should be selected to meet the requirements 
of such experiments rather than selecting the material and experiments 
to meet the requirements of the experimental design. The experi- 
mental designs described in the present paper were developed to satisfy 
requirements such as those described above. 

The class of experimental designs known as augmented designs was 
introduced by the author in 1955 to fill a need arising in screening new 
strains of sugar cane and soil fumigants used in erowing pineapples” 
(Federer [1956a, 1956b, 1956c, 1958]). An augmented experimental 
design is any standard design augmented with additional treatments 
in the complete block, the incomplete block, the row, the column, etc. 


1Paper No. BU-39 of the Biometrics Unit and Paper No. 341 of the Department of Plant Breeding. 

2G, N. Rao, Sugar Cane Research Station, Anakapalle, Andra Pradesh, India, has used some of 
these designs for field lay-outs of sugar cane (personal communication, 7/19/57). J. G. Darroch, 
Experiment Station, Hawaiian Sugar Planters’ Association, Honolulu, Hawaii, expected that eight 
experiments designed as augmented randomized complete block designs would be harvested in 1957 
and several more had been installed (personal communication, 8/23/57). 
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The construction and randomization procedures will become apparent 
after consideration of a few specific examples. Analyses for some of 
these designs have appeared in the literature (Iederer [1956a, 1956c]). 
The purpose of this paper is to present the general approach for all 
augmented designs with one-way elimination of heterogeneity and to 
present examples of two specific augmented designs, the augmented 
randomized complete block design and the augmented balanced lattice 
design. The general approach for designs with two- and higher-way 
elimination of heterogeneity will be presented in a forthcoming paper. 

Examples of specific designs with additional treatments, unequal 
replication on treatments, or unequal block sizes have appeared in 
statistical literature (e.g., Basson, [1959]; Corsten, [1959]; Das, [1958]; 
Graybill and Pruitt, [1958]; Justensen and Keuls, [1958]; Kishen, [1941]; 
McIntyre, [1958]; Pearce, [1948]; Yates, [1936b]; Youden and Connor, 
[1953]). However, a general approach covering the class of augmented 
designs as well as the others has not been presented. This is done 
in the present paper. 

The randomized complete block design and the one-restrictional 
lattice designs are well known examples of experimental designs with 
one-way elimination of heterogeneity. General methods of analyses 
have been developed for incomplete block designs and for non-orthogonal 
situations (e.g., Bose and Nair, [1939]; Federer, [1957]; Kempthorne, 
[1952]; Nair, [1941]; Rao, [1947]; Yates, [1934], [1936a], [1936b], [1938]). 
Analyses for augmented designs with one-way elimination of hetero- 
geneity are developed along similar lines. 


CONSTRUCTION AND RANDOMIZATION 


The construction of augmented designs with one-way elimination 
of heterogeneity is illustrated with examples. Consider first the aug- 
mented randomized complete block design. Here there are N; = n.; 
plots (experimental units) in each of the 7 = 1, --- , r blocks, with 
the V,; not necessarily equal; there are two kinds of treatments, treat- 
ments repeated 7 times and occurring once in every block and treat- 
ments repeated less than r times (the treatments could appear more 
than r times in the experiment and the analyses still hold) and hence 
occurring in only a portion of the blocks. Tor a large number of situa- 
tions a number v, of treatments will occur once in each of the r blocks 
and a number ?, of treatments will occur once in one of the r blocks; 
for convenience call the former group standards or standard treatments 
and the latter new treatments. The schematic treatment layout for 
r = 5 blocks, v, = 4 standards (A, B, C, D), and v, = 13 new treatments 
(eC, f9) hy 0.) Walenta. On pag ols 
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where the V,(=6 or 7) were made as nearly equal as possible although 
this is not necessarily the best grouping for all experimental situations. 

The randomization procedure for the augmented randomized com- 
plete block design is: 


(i) Randomly allot the v, standards to v, of the VN; = n.; plots 
in each block. 

(ii) Randomly allot the v, new treatments to the remaining plots, 
which is equivalent to randomly assigning the lower case letters 
to the new treatments and assigning the letters in order to the 
remaining plots. 

(iii) If a new treatment appears more than once, assign the different 
entries of the treatment to a complete block at random with 
the proviso that no treatment occurs more than once in a com- 
plete block until that treatment occurs once in each of the 
complete blocks. 


The augmented triple lattice design with v3, = v, = 9 standard 
treatments (capital letters) and 7, = 15 new treatments (lower case 
letters) is used to illustrate the construction of augmented incomplete 
block designs. The schematic lay-out for the treatments Is: 


Replicate and incomplete block number 
1 2 3 
1 2 3 1 2 3 1 2 3 
A D x A B C A C B 
B BE H D E F E D F 
C F I G H I I H i 
j i n p ip t u v Ww 
k m (a) q s — — = x 
Nth 5 5 5 5 4 4 4 5 
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The same procedure may be used to obtain other incomplete block 
designs. For example, a fourth group of treatments which would make 
the above an augmented balanced lattice design, is (for v; = 18): 


Replicate 4 
Incomplete block no. 


1 Z 3° 
A B Cc 
F D E 
H I G 
y Z a 
= | 
| 
1. jh 4 4 4 


The randomization procedure for any augmented incomplete block 
design for v, standards, repeated r times, and v — v, new treatments in 
6 incomplete blocks of size 7. ;, is: 


(i) Randomly allot the groups to the incomplete blocks within 
each replicate. 

(i) Randomly allot the standards within each group to the n.,, 
plots of block jh. 

(ii) Randomly assign the new treatments to the remaining plots. 

(iv) New treatments appearing more than once in the experiment 
should be randomly allotted to the incomplete blocks with the 
provisos that the treatment should not appear twice in any 
one replicate until it has appeared once in all replicates and the 
treatment should not appear twice in one incomplete block 
until it has occurred once in all incomplete blocks. 


Other incomplete block designs (Federer, [1955], chapters XI and 
XIII; Kempthorne, [1952], chapters 22, 25, 26) may be used to set 
up additional augmented incomplete block designs. The procedure 
is as described above for augmented triple and balanced lattice designs. 


ANALYSES 


The generalized form of the analysis of variance for an augmented 
design with one-way elimination of heterogeneity is presented in Table 1; 
the algebraic developments and the notational definitions for this table 
are presented in the section entitled Appendix. (This development 
logically appears in this section, but was relegated to an appendix 
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because of algebraic complexities.) The intrablock treatment means 
and variances and the interblock treatment means and variances are 
presented in matrix form. Likewise, the expected values of the mean 
squares #, , E, and E{ from Table 1 are presented in general form. 
Utilizing these results, the analysis of variance, adjusted treatment 
means, and variances for mean differences are presented for the aug- 
mented randomized complete block design and for the resolvable 
augmented balanced lattice design in the following two sections. 
Analyses for other incomplete block designs may be obtained from the 
general results given in the Appendix. 


TABLE 1 


ANALYSIS OF VARIANCE FOR EXPERIMENTAL DESIGNS 
WITH ONE-Way ELIMINATION OF HETEROGENEITY. 


Mean 
Source of variation df Sum of squares squares 

Total (uncorrected) N... Se mignY ain — 
Correction for mean = CF 1 Viny/ie Boat 
Among incomplete blocks y?, y? 

(ignoring treatments b-1 = ae 

and complete blocks) De aie ical a 
Among treatments (eliminating 

complete and incomplete v—1 yes FiQ:.. ae 

block effects) : 
Intrablock error n... —b—v+1 subtraction E, 
Complete blocks (eliminating 

treatments, ignoring r—1 De er Qs. —— 

incomplete blocks) : pe 
Incomplete blocks (eliminating b—?T 2 L(0; + Bjrn)Q.in Eo 

treatments and complete ’ Oe 

blocks) 7 
Among complete and incomplete aw 

blocks (eliminating treatments ) b-—1 yD 2 (0; 4+ B7,)Q.9, Es 

7 


AUGMENTED RANDOMIZED COMPLETE BLOCK DESIGN 


For the augmented randomized complete block design, it is simpler 
algebraically to look at the normal equations for the effects from the 
intrablock analysis rather than to consider the v formulae given by (3). 
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The discussion here refers to new treatments appearing only once in the 
experiment. If the new treatments appear more than once, it may be 
simpler to obtain solutions from equation (4). The normal equations are: 


ain at Dnt + D6; = Y.., 
r=1 7—2 
FRAC Sey em ahr 1c: 
7=1 
Standard treatments (in r blocks): 
rp + #4.) + 6 = Ya. 
7=1 


New treatments (in one of the r blocks): 


mi = pj se Tj = Visi , 
3 
where 


vr urtuy 


i ese as Ob) 
T=rrt+l 


r=1 t=1 


Solution of these equations for effects results in the following (Federer, 
[1956a] and [1956c]): 


1 


ies ze ee iI ie — 
pi = i Tp PP Fr 4 ae ae e, = Sa, 


7=1 k= 1=1 
(N; — v, = m,; = number of new treatments in block 7) 
p 55 Piss = Mee — p; = Ye . 
Thus, the new treatment means need to be adjusted for the block in 
which they appear. 


The various estimated variances between treatment means are: 
Two standards: 
ViGine = Urn) e200 ieee 
Two new treatments in the same block: 
V(Yiin — Yin) = 2E. @¥#®). 


Two new treatments in different blocks: 


Dy 
34 somewhat simpler solution could have been obtained by taking Sy, #ri = O rather than 
t=1 
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V(Yi;; — Yin) = 2H. + 1/v,) (7 se |pe j se h). 


A standard and a new treatment: 
Vigate OV ee i | 1 : 
Gr: te) lie detec ae a ee (aesahe 
vow <O bv, 


where E, is the error mean square obtained from an analysis of variance 
on the standards alone. 
The following numerical example was constructed for ease of com- 


9 


putation (r = 3,v, = 3.¥, = 2): 


| 
Replicate number 
il 2 3 
You = 9 Y 312 = 6 Ys13 = 12 
| Yuu = 5 Ys2 = 6 Y323 = 10 
| Ys = 7 Y332 = 6 Yos3 = ll 
Yiu = 13 Yiss = 10 m5 
Rep. Total ye 34 Woy Ss P43} Ve SS oe Y =195 
Total for 
Standards | Yea = 21 Y32 = 18 Wi = 74 
Standard Totals Yau S220, Y3.. = 21 VWoye, SP! 


Applying the formulae obtained above we find that @ = 10, 
¥, — i = =|) 745 = Fae = =e, Oy ae 733 = HG, Ta is 
Bh Fie = 2, a = i —2, ps = 3. 


In the analysis of variance table we obtain:* 


Source of variation df ss ms 
Total (corrected for mean) 10 76.5454 —- 
Blocks (ignoring treatment) 2 27 5454 
Treatments (eliminating blocks ) 4 45 WR25 
Error (elim. tr. and bl.) 4 4 1 
Block (eliminating treatment ) ® 42 Pil 
Treatment (ignoring blocks) 4 30.5454 = 
The error sum of squares equals 4 as it should since 64. = +)’ 
éo, = —1, 62 = —1, and €:.. = +1 was used in constructing the 


4The error (eliminating treatment and block) and the block (eliminating treatment) sums of 
squares may be obtained from the analysis of variance on the yields for the standards. 
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example and 1” + (—1)? + (—1)?+ Il? = 4. Also, from the formulae 
for variances, 
Vee. = Vix) = 2) 3 = e2/0, 
ViVi —* Yin) S72 Oe 3) 873; 
V@si. — Yu) = D+ 1/8 + 1/8 — 1/9) = 14/9. 
Likewise, it is possible to partition the treatment (eliminating block) 
sum of squares with v, + v, — 1 degrees of freedom into the following 


orthogonal contrasts:’ 
Among standards (v, — 1 d.f.) 


tr V2 72 
be Vu bani Ey 


1=1 ii rv, 


l| 


Among new treatments within a block (>. n;; — r d.f.): 
r nij nij 2 
>»: OS aos aad ($ Yui) /na — O; 
=1 i=1 


pp 


Standards vs. new treatments in block j (r d.f.): 

ar Dr Nij 2 

De {nu = Ynez — 0; ye r.} /0,%;0, + N1;) 
4=1 


(21 — 3(13))* rt (18 — 3(10))’ 
— «6808 + 1 3(1)(3 + 1) 
Thus, 6 + 0 + 39 = 45 = treatment (eliminating block) sum of squares. 


+ 0 = 39. 


AUGMENTED BALANCED LATTICE DESIGN 


The balanced lattice design, or its equivalent, the balanced in- 
complete block design, is described in various places (e.g., Federer, 
[1955], sections XI-3.3, XI-4, and XIII-2.1; Kempthorne, [1952], 
Chapters 22, 23, and 26). By including additional treatments in the 
incomplete blocks, an augmented balanced lattice design (ABLD) is 
formed. The simplest ABLD is the one formed by including new 
treatments only once in the experiment. If some of the new treatments 
appear in more than one incomplete block, the computations become 
more complex, but the general results given in the Appendix apply. 

For the ABLD, it appears that the restriction d'2, 4, = 0 results 
in a simpler solution than the restriction Si nee ees RL 
Dia 7; = 0 (the first v, treatments are standards and the remaining v, 
treatments are new ones). Using the Sie, 7; = O as the restriction, 
it must be remembered that the differences between treatment means 
are unbiased but that the treatment means themselves are biased 
considering all v treatments as fixed effects with 


‘Suggested by a referee. 
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ae | Le 
=~ Vw... -» == tr, = 0, 

v t=1 v t=1 
where u;.. equals true treatment mean and uw = >> u,../v. In most 
situations, interest lies in differences between means rather than in the 
means themselves. 


Intrablock analysis: 


The intrablock analysis of the ABLD with the standards in a 
balanced incomplete block design for v, = hk, r = k + 1, and 
b = k(k + 1) incomplete blocks of size k, is relatively simple com- 
putationally for the standards in k + 1 replicates and the new treat- 
ments included once. Using the v equations obtained from formula (3), 
add 1/k times the sum of the normal equations for the new treatments 
which appear in an incomplete block with the 7th standard, to the 
normal equation for the 7th standard. Performing the same operations 
on the normal equations for all standards results in a set of equations 
involving only the effects and the yields associated with the standards. 
The resulting equation for the 7th standard treatment after using the 
equation >)*", #; = 0, is 


pass r ; k+l ok 
E vies ke i Sain 0 | 


where 9,.;, equals jhth incomplete block mean on standards only and 
} = 1 = number of times any two standards appear together in an 
incomplete block in this particular ABLD. With estimated treatment 
effects for the standards it is possible to solve for the remaining 


» — v, = v, ?; for the new treatments simply by substituting the 7; 
for the standards and solving the n.;, — k equations in the jhth block. 
Since #; = 9,.;. — 9, (where 9,.;. = jth complete block mean and 


j, = overall mean obtained on the yields of the standard treatments 
alone), the 8;, may be obtained from formula (8) reduced as follows: 


k+1 


kB; a, ji ae z il Y nar > ¥ rinfin 


= Goes, ao Sedu, = KG,.;. pa Me) = OC e 
1=1 


With the additional equation >>*_, 8;, = 0, we find that the solutions 
are: 


A lee oe 
Bin = 7 Qin ) 


which is the usual solution for a resolvable balanced lattice design with 
the parameters v = k’, k = block size, r = k + 1,b = k(k + 1) and 
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\ = 1. The above solutions are used in the computation of the sums 
of squares in the analysis of variance. 

The treatment mean adjusted for complete block and for incomplete 
block effects is @ + 7, and the variance of a difference between two 
adjusted means in this ABLD is one of the following: 


V (difference between intrablock means of two standards 
=p+7;—-a—7, = 7% — 7.) = 207/k, 
V (difference between a standard and new treatment occurring 
in the same incomplete block) 
= (l/h Hh tk DJ —2/k) = oh? 2k — Dyk 
V (difference between a standard and new treatment not occurr- 
ing in the same incomplete block) 
= o(i/k + (ke + & + 1)/k*) = ok + D/F I, 
V (difference between two new treatments in the same incom- 
plete block) 
= {Qk +k + 1)/k’ — 2( + 1)/k} = 20%, 
V (difference between two new treatments in the same replicate 
but not in the same incomplete block) 
= Qo(k’ + k + 1)/k’, 
V (difference between two new treatments not in the same 
replicate) 
= 09 (2k +k+1/k — 2/h) = W(k +h? +k — 1)/FK. 


The n”' in formulae (4) to (6) may be obtained from the above variances. 
The intrablock mean square LH, is an estimate of «7. Hence, the esti- 
mated variances are obtained simply by substituting #, for o2 in the 
above six variances. 

The following numerical example was selected for ease of computa- 
tion. The non-randomized layout is presented below along with the 
computations: 


Replicate I Replicate II Replicate IIT 
i ; Ye, =4 Yao=14 Yin =11 Yor =8 Yia =7 Yoo =10 : 
You=5 Ys12=13 Ys =10 Yso=15 Ys31 =11 Yas2 =12 
You =7 Yin =9 
You=9 
Totals re =25 Yw=27 ¥s,=52 |¥2n=21 ¥»=32 Yo =53 |¥.n=18 ¥.2—22 Ys, =40 
Totals 
standards Y311=9 Y312=27 Ysi,=36 |Y¥321=21 Ys.2=23 Ys2,=44| ¥3.1=18 Ys.9=22 Y3,3, =40 
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ireatment, tovale) Mpa = =o, Pe, Ct, Ve a oor Ye ae 
Veo Yo. =o Grand total. =~ y “= 145; erand total’on stand= 
Sie == We = 820) 
Q,.. = —45/12, Q2.. = —59/12, Op = 82) 12; 
Q: = Q; Oe =s g 12 Om = 3a/ 12; 
Q:. = —20/12, Oa = —4, Os = 4, 
Qn = 0, Qo. — 0, Q 310 —8/3,, 
"30 = 8/3. 
a= 120/12 = 10, f, = (36/4) — 10 = —1, 
el Mae oA era ey ee One e457 12 = 
2, 2) Os SOF ee 4) = 59712 sath 
2, Oi 60 612 20) BO ee 0 81/4 82712 +3 
eee 9.05.) 0) 2 200 M8 0) 0) 0. t= eeOne 
; 1/4 1/4 0 7/4 3/4 1/8 9/12 1 
te Vist sy 208 Oe 3/4 74 178) 433/12 3 
Pal IO. 4 1748 08178) 478) 7/45— 20/12 =) 
edly i 3/4 Oe O00) 208 3.0 i—4 —3 | 
Bis BYE OOP RE ie) 4 3 
GciaeeOm BOs it 0-8".0.2 20 0 PE As0 
Boo Or 0 By Ce AN a 0 0 
oe GO Ont 3/4 09-878 =o 
Seeder Oni 0 Orr 0. 20) - 23/452 8/3 2) 


Using the above, the sums of squares in the analysis of variance 
table for the standard treatments are computed in the usual manner 
(see Federer, [1955], page 342) and are: 
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Source of variation df ss ms 
Total 12 1330 — 
CFM 1 1200 _ 
Replicates 2 8 = 
Standards (ign. blocks) 3 238/3 — 
Blocks (elim. treatment) 3 104/3 104/9 = Ey» 
Intrablock error 3 8 8/30 


The analysis of variance on all treatments is: 


Source of variation df 88 ms 
Total 15 1541 — 
CFM 1 4205/3 — 
Replicates (ign. tr. and bl.) 2 54/6 = 10.8 | — 
Blocks (elim. reps; ign. tr.) 3 4447 /60 — 
Treatments (elim. bl. and reps) 6 Sa AW IVC MP 
Intrablock error 3 8 Si Saw 
Blocks (elim. tr. and reps) 3 104/3 104/9 = EF, 


The various variances of a mean difference between two adjusted 
treatment means are: 


Variance of difference between two standards 
QE GAS) oe 


Variance of difference between standard and new treatment in same 
incomplete block 


I? + 2k — ; = ee = 1) a 
z( Ie —- 3 4 — 14/3. 


Variance of difference between standard and new treatment not in same 


incomplete block 
: : ty’ ; (3) 
p(bA1Y .8(8) <6 
k my 


Variance of difference between two new treatments in same incomplete 
block 


2E. = 16/3. 
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Variance of difference between two new treatments in same replicate but 
different incomplete blocks 


Not applicable for this example. 


Variance of difference between two new treatments in different replicates 


2E(§ +h 2 — 5 _ (8)(8 + 4 t 2— ) - 26/3. 


Interblock analysts: 

The amount of intrablock information is w = 1/£, = 3. In order 
to obtain the amount of intrablock information, we first need to obtain 
the expectation of E, from formula (138). After substituting in the 
various values, a rather surprising result is obtained in that /, in the 
ABLD with new treatments occurring only once has the same expecta- 
tion, o2 + k’o3/(k + 1), as for the standard balanced lattice design. 
Therefore, w’ = k/[(k + LE, — E.] = 2/(3(104/9) — oye LUANG 
With these weights, we now proceed to the computation of the n;, 
and Z,.. from formulae (21) and (ogee iNnusi 7s aeee LAOS ee 
—2.132, r& = 3.132, rt = 440, r§ = 148, 7% = 2.148, 7r* = —2.000. 
The variances of a difference between any two 7* may be obtained from 
formula (24). Thus, the variance of 


998 , 1207 _ ( = = i 
+ 319, 107 1071 


re — T% 


=) = YANG, 


SUMMARY 


An augmented experimental design is any standard experimental 
design to which additional treatments (new treatments) have been 
added to those (standard treatments) appearing in the standard ex- 
perimental design. The additional treatments require enlargement of 
the complete block, the incomplete block, row, column, etc. The 
groupings in an augmented design may be of unequal size. The con- 
struction and randomization procedures, and the general method of 
analysis have been given for all augmented experimental designs with 
one-way elimination of heterogeneity from the experimental area. 
The general results are illustrated algebraically and arithmetically with 
two examples, an augmented randomized complete block design and an 
augmented balanced lattice design. Analyses with and without re- 
‘covery of interblock information are considered. Some discussion of 
unequal incomplete block sizes 1s given. 


6The suggestions leading up to equation (26) were not followed in the computations; the above 
yields approximately the same results for this example. From equation (26) wv’ = 9 (53) /(53(104)- 


23(24)) = .096. 
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APPENDIX 


The generalized analysis of variance is developed below for all designs 
with one-way elimination of heterogeneity. Such designs as the random- 
ized complete block and incomplete block designs which are resolvable 
(the v treatments occur together in a complete block) and which are 
non-resolvable (v treatments occur in 6 incomplete blocks of size k, 
for k < v), are considered. 

To be completely general, let the 7th one of the v treatments be 
replicated r; times in the b incomplete blocks of size n.;,. Let the 
yield of the zjhth observation be expressed by 


Vase = Mahe Fo pp Bn Gain) (1) 
where? = 1, --- ,v = number of treatments; j = 1, --- , 7 = number 
of complete blocks; h = 1, --- , k; = number of incomplete blocks 


in the jth complete block; n;;, = 1 if 7th treatment occurs in the hth 
incomplete block of the jth complete block and zero otherwise;’ n.j, = 
number of treatments in Ath incomplete block of the jth complete 
block; .;. = v; = number of treatments in the jth complete block; 
n;.. = 7; = number of replicates of the 7th treatment; 


2 2 v * kj 
Sater 2 = Le Dali 5 egy Se Te Pp Ors 
= = = = 


wu = a general mean effect = z Dee = oa? Ss 
es 
1 r kj 
where k; = number of incomplete blocks in the ar complete block; 
T; = Wy.. — » = a treatment effect; p; = u.;. — » = a complete block 
effect; Bj, = M.j, — 7. = an incomplete block effect; and «¢;;, are 


random independent effects with mean zero and common variance eee 
These definitions imply >>?_, 7; = ae py = > iG. = OseeOther: 
definitions for the effects are permissible. 


Intrablock Analysts 

The least squares estimates of effects for the above linear model 
(intrablock analysis) are obtained by minimizing the residual sum of 
squares: 


v r kj 
»S, >> Deon = yD oS De Oee ye aie ee Bin)’ (2) 
h=1 ee cee a 


s=1 7=1 


7To be completely general, nij, could be the number of times treatment 7 occurs in block jh 
instead of 1 or zero, but this was not done here. 


464 BIOMETRICS, SEPTEMBER 1961 


Equating to zero each of the partial derivatives of the above residual 
sum of squares with respect to p, Ti 5 A; 5 and @;, results in the following 
normal equations: 
. r r kj A 
is Se Pie >» Nea ee N.irPBin = Vann 
4a=1 j=1 j7=1 A=1 
= grand total, 


f “e ki 
Tit Wee (Bt aie) sate D>) ts.be + Dy NijBin = Yo.. 
j=1 j=1 h= 


1 


= ith treatment total, 
v kj 
pn tee Opie et = tt gales = Y,. 
t=1 h=1 
= jth complete block total, 
Bux Nia Ps Bin) ae De Minh = Vijn 


= jhth incomplete block total. 


In the 7, equation, substitute for @ + A; + B,;, from the 8;, equations 
to obtain: 


ee ys Bere 2E Nints = bea » DS nian = Q,.. (3) 
7 h efile a% 7 A 


When n,.. = rand n.;, = k the above equation becomes: 
mate i 
UB Se k xs Ug 22, De Meinisn =) 


where >); Dox %oinMiin = Agi = number of times the gth treatment 
occurs with the 7th treatment in all incomplete blocks. For balanced 
lattices \,; = \ a constant for 7~ g and the solution for 7, is (for iS 
zero): 


?, = kQ,../(kr — 2d) = kOe — I /rok — 1) 


as given by Federer ({1955], formula XIII-2, where Q.,,, = Q,..). 
For the general case we can add d, >, #; = 0, where d, ¥ 0 for at least 
one value of g, to each of the v equations in the 7; ; adding d, >) 7; to 
each of the equations in (8) results in: 


Texte > Ss > =e Ez, Fi(Nisn = d,) faa OF : 


a im Watis 


We have v equations and v unknowns and the problem is to solve the 
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equations. In matrix notation the v + 1 equations from (8) plus the 
equation >> #; = Ois: 


Pre, —_ Se > ha -> ye ye => Sy) ea 1 
ih Min Nin 
A 
— NainNiin ar Nosh abe. Pe: NojnMoin 
a 0B, Nir ie > 2s DS >> in 


Se 


2 
-D pm yp yt EB 
Ie 1 1 1 0 | 
4 ON SiGe oS Di aver 
72 Oo. = Yoo = 3 A: 


ae OO =e Nee SS Daten 


Again in matrix notation the solution for the 7; are obtained as: 


x 11 12 1 
ay n n ee Tt ie 
21 22 2 
tz |= 12 n te NG 
2 
z, n n tt nr” Oo} 


where n*” are the elements of the inverse matrix. The solution for 
7, 1s 


(i Do , (4) 
i=1 
the variance of 7, is 
VGj= 1, (5) 
and the variance of a difference between 7, and 7, is (Nair, [1941]): 
V4, — 4) = @(n" +n” — ni? — nn"). (6) 


In the p, equations, substitute for @ + 7, from 7; equation, thus for j = f: 


N.¢.P¢ — Siem xu pias. — ee Dy DS neinbin te Di n.nBn 


a 7 h 


= Vee. — De Mt Gs. = Q. >. : (7) 
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The solutions for the p; and 8;, must be obtained jointly since they are 
not orthogonal and since three sets of unknowns are present. Equations 
involving the 7; only are possible here because the incomplete blocks 
in the complete blocks can be considered as b incomplete blocks. For 
the solution we proceed to the 6;, equations and substitute for pf + 7; 
to obtain (for j = f andh = e): 


Nebr + Ba) = ie > pe, NisnlP; + Bin) 


=e hel fe De Mises. = Q 5 
LE 
From these b equations solutions for p; + Bj, are obtained. Summing 
over h, solutions for the 4; are obtained since ae Bi, Omen ie Oais 
sn 


(8) 


less than v, then solve for p; + 8;, ; if not, solve for the 7; . Of course, 
as a check one could obtain solutions for both sets of effects and the 
results must check by satisfying the normal equations. Also, 


1 rp kj r kj 1 v 
zR= a ee pe »s =n 3 mantih 


j=1 h=l a pe Wie FT (9) 
1 v 1 r kj a 
= 7 yy oS ie De aos a Bi}. 
= me Ue FS Sa 
The p* in Table 1 are obtained from the r equations 
D nN; ‘ Tr v . 
SE UD a Das Sey pias Sar = Q.5. (10) 
i=1 CaSO N Olaatl FT 


plus the equation >>%_, p* = 0. These equations are obtained from 
the normal equations for u, 7; , and p; setting each 6;, = 0. 
The expected value of EZ, is az. The expected value of 


r ky; ——— i kj r kr ; 

2 se (p; + Bi)Q.i, = SS SS Qin BD Re, 
7=1 h=1 7=1 h=1 f=1 g=1 

ihfg 


(where :’’’’ are the elements of the inverse matrix in the solution of 


a 
the p; + B;, and where p; and @;, are random independent effects with 
mean zero and variances o, and a3 , respectively) is: 
ae Ue aD BG 2 Z 
se aes jhf —_ — 
zi baa 2 ye ye, S k’ ilies = ¥ minds.) (Yor a ¥ nando. | 
j=1 h=1 f=1 g=1 i=1 d=1 


kj 


5 
= aD > ln = 2n in yy alse 


7=1 h=1 i=1 thw 


. 2 . 
+ Be ye Sam | 
1 


i=1 N.. d=1 %q.. e= 
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r ky k; 
Te m SD ny Kl mata re Sy Meier. 
i=1 Hes 


7=1 A=1 g=l N;. 
h¥g 
v 
NasnMas ae Nai 
= Nig ay: Se 7 ig nN: 
eH, Oe he a > uae 
r kj r kr 
= by ' > : Km] n So eee Sans Sy Resales: 
th if 
j=1 at oe g=1 i=1 es asin ge, 
Th¥fo 
i¥s (11) 
v n D we | 
igh dih oe 
— Nie.Nde. 
i=1 Ni. d=1 Na e=1 ES : 
> kj 
2 jhih| 2 Nith 
“+ AEE oy k E ih Qn. in se, 
7=1 h=1 i=l 7 
ys | 
a Nea 
gai 15... a e=1 = ae 
ES le Ma Mt HD 
= ss jiMs Wie ‘sy dfg!“dih +n SS, Vi fg ith 
Ls -th «fd 
f=1 h=1 f=1 g=1 Aa Weis Jon UB 
thAf9 
—. “1 
ith d 
= 3 = iS Sees 
gai 13.. a=1 Na.. e=1 c=1 


4+ g(b — 1) = Kio, + Ko, + (6 — lo. 


The expected value of >);-1 p*Q.;. is [where a’ are the elements of 
the inverse matrix obtained in the solution of the p* from formula (10)]: 


72 f=1 
f . VP v 
if 
~S(v,.- Eas.) Der(v.- aww) 
j=i t=1 = j=l 
; i E D Aas D N:; 
= oN al'(nt,. — ang BEE DBE De Soy.) 
j=1 i=1 Nj. gai Nie dai Na+ e= 
‘ft ut v v 
‘ Nip Niaz. Nap Maj + 
SS lon ae EP a 
f=1 f=l we | ne d=1 Na+. 
1#sf 


(12) 


| 
S 
M 
i 
= 
= 
—— 
Ya 
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r k 
=F >; oe yp ae 2% Y misctar) 
t=1 23... a=1 Na.. e=1 c=1 
Le Tr 


kj ” Nee Nes ke v nN:: N: 
= Sal ag ee 
h=1 t=1 t.. g=1 i 


i=1 N;.. 


G=l fel 
if 


v v r ke 
ae pee pps >D 3 mtn} 
t=1 N;.. a=1 Nd.. e=1 c=1 
+ or — 1) = K3or + Kyop + (r — Dot. 
Henderson [1953] has obtained expected values for sums of squares 
from non-orthogonal classifications for different situations. 


Now, Do jai Dia or + ew Oia 27-1 p*Q.;. is the sum of squares 
for incomplete blocks within complete blocks eliminating treatment 
effects and has the expectation: 

(K, — Kop + (b -— Noe. (13) 
By definition the coefficient of o; must be zero; hence, K, = K; ; no 
proof of the equality is given here (see Yates, [1938]). It should be 
possible to simplify the coefficient for o3 since there is a relationship 
between the a’ and the k’"”’. Perhaps this should be done prior to 
programming for high speed computers. 

The treatment mean adjusted for incomplete and complete block 
effects is @ + 7;. Only intrablock information is utilized in obtaining 
the adjusted means. The variances of adjusted means and for differences 
between adjusted means are given above (formulae (5) and (6)). 


Recovery of Intrablock Information 


The sum of squares to be minimized is 


ee, 
Ww a ey De Misa Yern = t= pp — 1 — Bay 

rk ue 

ote ie dX ee OB tere h(i acy) a Do Mints) /Min 

where the true weights are w = 1/o? and wf, = 1/(o% + n.,,0) for 
8;, independently distributed with mean zero and variance oj and 
where the estimated weights are” w = 1/62 and wi, = 1/(62 + 1 .;n63). 
Instead of using a different weight for each incomplete block, an average 
coefficient is utilized and is given below. The estimated weight, w’, , 
should be utilized if there is sizeable disparity among the n.;,. The 
resulting normal equations for w/, = w’ are: 


SIt is assumed that variable 7.1, has no effect on the intrablock variance, (see Finney [1956]). 
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ui (w+ w’) yum. + >. 70; + on ah 


+ w 2 5 N.Bin = (W+ MD) ek (15) 


7=1 A=1 


or 


1 a Dat ees ae ea + me > 22M. nBin ae dee 


7 
T, 1 UN,,.7, + w’ Ee oa 5 Mssaltoin 


7=1 h=1 n SEG 


kj 


a (w + w’) 3 Nj. (u i p;) si WwW oz Dy MoinBin (16) 
= apy =F ap" a SS CRUE , 
px ue wns Papi) 3 ny.n 


+ w soa PB ry == 0 LY a (17) 


h=1 


or 
ee tn it) hoes is ee iG nbin = Yai. , 
Bin nal + p; + By) + Do Mints a eae (18) 


Substituting for B;, from equation (18) in equations (15) and (17) 
results in 


fe > pepe ite Yo (19) 
j=1 4=1 
~; ona pe “FP py) =F Do 14.7 Sieret (20) 


Substituting for B;, from (18) and for u and p, from (19) and (20) we 
obtain: 


v r ki 
wy ty = (00 — 10) re Sy Sy Matt = wt BS ny 
t=1 j=1 h=1 -th 7 dis a 


Ti WY. a > Ye tad 4 (21) 
“1p wid 3 So raindl-in = Dnt} = Z,..- 


g=1 h=1 
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The above v equations plus an additional equation, e.g., Se) 
results in unique solutions for the r* ; thus 


fol 12 1 
r¥ Come eset All| 2. 
2 22 2 
720 a Gee ee DRC ma pee (22) 
7k ¢'! (ee C ie 


where the original equations were in the form: 


[ mas UG, 822 Wine, 1 | 7 | Vig) 
N21 Nee N25 1 Tol Zo. (23) 
Nyt Ny2 Noy ce Z, 
il 1 1 0 Ome eeOne! 


and where the c’” are the elements of the inverse of the matrix of co- 
efficients for the 7* . 
The variances of a difference between two adjusted means recovering 
interblock information, say w* + 7* and u* + 7%, is 
V(r% — 74) =e +c” —c” —c’, (24) 
and the variance of r* is: 
VGS t=.) (25) 
It should be noted that of and oj appear in the expected values 
of the Z,.. . 
Now let us return to the calculation of the weight w’ = 1/(¢% + 765). 
n is determined from a nested classification in which there are no treat- 
ment effects (Federer, [1955], page 106). In the present notation the 


expected value of the mean square for among incomplete blocks within 
complete blocks in the absence of treatment effects is: 


1 kj er 
ea, hae 


7=1 h=1 


and hence 
r kj 
where b = )77_, k; . 


The weight w’ for the jhth incomplete block will be (for individual block 
weights) 


Vin = 1/(é == 1. ind) 
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The expected value of the mean square, #7, , for incomplete blocks 
within complete blocks eliminating treatment and complete block 
effects is: 


Po Kay =e ge 


The average amount of interblock information is estimated as follows: 


w’ = m/[nE, — (ai — ME), (26) 
and the intrablock information is estimated as 
w= 1/hL. 


With these weights it is now possible to obtain solutions for the 7% 
in (22). 
Incomplete Blocks Not Arranged in Complete Blocks 


The previous results may be used directly for an incomplete block 
design for which the b incomplete blocks are completely randomized. 
To apply the formulae set p; = 0,7 = 0,k; = 6, n:; = Nor], andr = 0. 
The equations then become: 


Se ea eae bs ex), i = 0. se Dy Ms.75 Se 2d Bs ) 


Vis. 


b > 
n; (ue 5 Ti) ae pm NinPn » Yi= NM n( =e Br) alls De Mints ) 


7 5 
Nit 
MN, By — De A ys NinBr 
1=1 a. h=l1 
b 


Dv v 
N,.T, = epee Wists — yee —z De Nanl.n 


lel Cape 


v 
aa ys NisYi., 
a=1 


and the expected value for blocks eliminating treatment effects sum of 
- squares becomes: 


b : v i v b 
tee re ieee De na) 
h=1 a te 


gai %;. a=1 Md. e=1 


b b v v D : 
250 aos Sn Ue (27) 


hel fal Gand fe, Uae 
hAf 
v b 
2 >> A Sy at me nu} = (b — l)oz + (b — 1)mo; . 
gai ;, a Ma. e=1 


Utilizing these results, the analysis goes through in much the same 
manner as for the experimental design in which the incomplete blocks 
are arranged in complete blocks. 
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If the information from complete blocks as well as from incomplete 
blocks within complete blocks is utilized, the sum of squares 1s 
aw 


Sore Dotty (9; + Bin)Q. jn and its expectation is obtained from equation 
(11) which reduces to (27) above. If the differences among complete 
blocks are random effects, perhaps the weight w’ should be computed 
from E/ instead of EZ, in the analysis of variance table. 
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PHENOTYPIC, GENETIC AND ENVIRONMENTAL 
CORRELATIONS 


S. R. SEARLE 
N. Z. Dairy Board, Wellington, New Zealand. 


INTRODUCTION 


A phenotypic correlation is the correlation between records of two 
traits on the same animal and is usually estimated by the product- 
moment correlation statistic. The genetic correlation, on the other 
hand, is the correlation between an animal’s genetic value for one 
trait and the same animal’s genetic value for the other trait, estimators 
for which have been proposed by Hazel [1943]. Estimates of these 
correlations are widespread throughout the literature of animal breeding 
and in many instances the estimate of a phenotypic correlation is 
reported smaller in magnitude than that of the corresponding genetic 
correlation, e.g. with certain poultry records, in Lerner & Cruden 
[1948], sheep records in Morley [1951] and with certain dairy records in 
VanVleck [1960] and Searle [1961]. Such results may seem a little 
unexpected at first sight since phenotype includes genotype and one 
might anticipate the correlation between phenotypes to be larger than 
that between genotypes. When estimates have not followed this pattern 
the explanation is sometimes given that a phenotypic correlation less 
than a genetic correlation is the result of a negative environmental 
correlation in the records of the two traits. This paper investigates 
the relationship between these three correlations on the basis of a 
linear model, and demonstrates the situations in which this explanation 
is correct. Other comparisons are also made. 


LINEAR MODELS 


Suppose the records of two traits in an animal are « and X. 
Neglecting the general means, we will take each variable as being the 
sum of a genetic term and an environmental (including error) term, i.e. 
and pan ech 

X=G+E. 

The genetic correlation r is the correlation between g and G; that be- 

tween x and X is the phenotypic correlation, R say, and we will define 
474 


(1) 
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al aa ae a 
ry’ as the environmental correlation, namely that between ¢ and L. 
Thus we have 


I 


cov (g, G)/o,06 


=e) 
] 


cov (a, X)/o.0x , 


and 


=. 
I 


: cov (e, E)/c.cn - 


The covariance between g and G is denoted by cov (y, G) and their 
variances are o> and ag respectively, with similar notation for the other 
terms. The phenotypic correlation FR is that between 2 and X which 
can be obtained directly from (1) as 


cov (g, G) + cov (g, #) + cov (G,e) + cove, F) | 
Vie. + 2 cov (g, 2) + aillee + 2 cov (G, B) + of] 


Assuming all genetic-environment covariances (i.e. interactions) are 
zero this becomes 


R= 


_ cov (g, G) + cov €, £) 
~ V8 + ot)(og + oe) 


This reduces to 
A ee (le) (3) 


where h and H are the heritabilities in the narrow sense, of the two 
traits, defined as o?/(o, + o,) and o2/(o2 + op) respectively. This 
relationship between the three correlations, phenotypic, genetic and 
environmental, is derived in Lerner [1950] using the method of path 
coefficients. In both cases genetic-environmental interactions have 


been assumed zero, as is customary in discussions of this nature. 


ENVIRONMENTAL CORRELATION 


Equation (3) can be re-arranged to give the environmental correlation 
as 


w= (R 7 Vin) Vl WU — #2). (4) 


In terms of the model (1) this is the correlation between ¢ and H#, which 
include the random errors. Assuming the correlation between these is 
zero, 7’ can be thought of as the environmental correlation. Phenotypic 
correlations are usually estimated directly whereas genetic correlations 
are derived from covariance analyses between relatives and contain 
additive genetic variation only. Environmental correlations estimated 
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from this formula may therefore contain genetic elements over and 
above additive genetic variation. 

Consideration of (4) shows that when R and r have the same sign 
r’ will be negative if r is greater than R/~/hH. When r and RF are of 
opposite sign 7’ has the same sign as R. Thus the phenotypic and 
genetic correlations being of opposite sign (an infrequent occurrence 
one would imagine) implies that the phenotypic and environmental 
correlations have the same sign. 

Genetic and phenotypic correlations of similar sign is the usual 
situation, and in this case we see that the ratio of the phenotypic to 
the genetic correlation has to be less than the geometric mean of the 
heritabilities, before the phenotypic correlation being less than the 
genetic correlation implies a negative environmental correlation. 
Values of this geometric mean are given in Table 1. 


TABLE 1 
VALUES OF A = \/hH’ 


Heritability of second trait 


Heritability 
of one trait all BO se 540 ROL 6s a eS ee OO eete 
sll .10 
we Pia 20 
3 lee cee OO 
A A SBS BS 20) 
a ee oe ee) aD 
6 .24 .85 .42 .49 .55 .60 
ai cade BN AG) GBS EOD) GS AO) 
8 -28 40° 749° 257 63. 569" 375 580 
29 OO 242) 52) 260) 67s 73) ee Ou Soe oO 
1.0 say) ays 0S A RL CEB) OR LOD) 


1The environmentai correlation is negative when the ratio of the phenotypic correlation to the 
genetic correlation is less than A. 


This shows that for traits with low heritabilities (as is the case with 
many traits of economic importance in farm animals) the ratio 
phenotypic correlation/genetic correlation 

has also to be low before a negative environmental correlation is im- 
plied. For example with heritabilities 0.5 and 0.3 this ratio must be 
less than 0.39, i.e. the genetic correlation must be more than two-and- 
a-half times as great as the phenotypic correlation for the environmental 
correlation to be negative. 
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A phenotypic correlation less than its genetic counterpart, together 
with a small positive environmental correlation, will occur where the 
genes governing two traits are similar but where the environments 
pertaining to the expression of these traits have a low correlation. 
For example, the genes controlling milk production in the first month of 
a cow’s lactation and those controlling total lactation yield may be 
quite highly correlated; but the environments pertaining to the first 
month and to the lactation yields may have a low correlation. The 
phenotypic correlation would then be less than the genetic correlation. 
This is observed in estimates reported by Searle [1961] and VanVleck 
[1960] and also by Lerner and Cruden [1948] for egg production. The 
situation of an environmental correlation having a small value is by no 
means universal and in many instances its value will be large (positive 
or negative) because the environment generally affects an individual 
in all its parts and functions. 

The equation (3) for the phenotypic correlation R can be written as 


R= Ar Br’ (5) 


where A = VhH and B = V(1—A)(1 — A). Because A < la 
negative value of r’ implies & being less than r. Thus a negative en- 
vironmental correlation always implies the phenotypic correlation being 
less than the genetic correlation, although it can be less without the 
environmental correlation necessarily being negative. 

The equation for R when the heritabilities are equal is 


R=hr+(Q —-/Ar’ 


from which it is seen that if any two of R, r and r’ are equal the third 
is also. Thus equal heritabilities imply that equality of any two of 
the correlations is tantamount to equality of all three. 


PHENOTYPIC AND GENETIC CORRELATIONS. 


The relationship of R and 1’ to r can be discussed in terms of equation 
(5), first noting that A + B < 1 because A < 3(h + H) and 
B <i(2 —h +4) arising from a property of geometric and arithmetic 
means. The phenotypic correlation Ff, can only exceed the genetic, 7, 
when the environmental correlation r’ also exceeds 1, and sufficiently 
so; in fact r’ must be greater than r(1 — A)/B. Thus Rk 2r according 
as 7’/r 2 (1 — A)/B, and since (1 — A)/B > 1 there is a small region 
where R < r but 1’ > 7; otherwise k and r’ are less than r together. 
Values of (1 — A)/B are shown in Table 2. An example is for heritabili- 
ties of 0.3 and 0.5, for which the entry in the Table is 1.03; thus, if the 
environmental correlation is less than 1.03 times the genetic correlation, 
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TABLE 2 
1 —-~~VWhH 
V(l — A) — A)? 


Vatuns or (1 — A)/B = 


Heritability of second trait 

Heritability i 
of one trait eal a 3 A & 6 aif 8 A929 C120 

ail 1.00 

2 1.01 1.00 

é [FOS OL 00) 

4 Ol O24 00/ ele O0 

355 OSE OS 03 a OOOO 

6 (E21 OOF IROL 1 O0R 00 

ah Were WPS) eNO) 1 OY 

8 Mall ILO Gl ek il} I LO) ah ee 

9 Preaey UAW iy GY iG) ess 8 ALLOY 1.0e 


*The genetic correlation exceeds the phenotypic correlation when the ratio of the environmental 
correlation to the genetic correlation is less than (1-A)/B. 


then the phenotypic correlation is less than the genetic correlation. 
The values in this Table are close to 1.00 for heritabilities that are 
small or alike, and for practical purposes in these cases both r’ and R 
exceed or are less than r together. But when the heritabilities are un- 
equal the fraction is larger, for example, its value is 1.50 when h = 0.2 
and H = 0.8. 

The equation for R represents a plane in the 3-dimensional system 
having co-ordinate axes for R, r and r’, but the relationships just dis- 
cussed can be illustrated by plotting the line of intersection of the planes 
R = Ar + Br’ and R = k on the plane R = 0, for various values of k. 
This is shown in Figure 1 for the example h = 0.2 and H = 0.8 with 
K = 0.47’, and R < rforr’ < 1.50r. Lines for R = 0.3 and R = 0.2 
are shown intersected by r’ = 1.50r. Below this R < rand above it 
Rk >r. Between r’ = 1.50r andr’ = 1, R is less than r and r’ exceeds ie 
i.e. the phenotypic correlation is less than the genetic but the environ- 
mental exceeds it. Only the first quadrant of the (r’, r) Cartesian 
system is shown, but extension of the lines for R into the second and 
fourth quadrants demonstrates the conditions under which the genetic 
and phenotypic correlations are of different sign, and the environmental 
correlation is negative. 


SUMMARY 


(1) The phenotypic, genetic and environmental correlations, R, r 
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«4 
Above this line R>r / 


y, A 
Between these / 
26 pe inesihr ae 7) 
Y and n> Pa 7 
7 a Below this line 


<3 “a R<2 


ENVIRONMENTAL CORRELATION r! 


22 


| 


GENETIC- CORRELATION r 


FIGURE 1 
PHENOTYPIC CORRELATIONS R = 0.2 AND 0.3 FOR HERITABILITIES 0.2 AnD 0.8. 


and 7’, are connected by the relationship 
R=rVhaH +r’ Vi —AG — A) 


h and H being the heritabilities of the traits involved. 

(ii) The environmental correlation is negative when f and r have the 
same sign only if R/r < WhH; it is negative when Rk and r are of 
opposite sign and F is negative. 

(iii) Equality of the heritabilities implies that when any two of the 
correlations are equal there is equality of all three. 

(iv) The phenotypic correlation exceeds (or is less than) the 
genetic correlation according as the ratio of the environmental to 
the genetic correlation exceeds (or is less than) the value of 


(ld — WhH)/V( — hl — #). 
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RELATIVE EFFICIENCIES OF HERITABILITY 
ESTIMATES BASED ON REGRESSION 
OF OFFSPRING ON PARENT" 


B. B. Bouren, H. BE. McKean, 
Population Genetics Institute, Purdue University, Lafayette, Indiana, U.S.A. 
AND 
YukKIo YAMADA 
National Institute of Genetics, Misima, Japan 


INTRODUCTION 


The problem of optimal estimation of the coefficient of regression of 
offspring on parent (in the sense of minimum variance), when the 
number of progeny per parent is arbitrary, was completely solved by 
Kempthorne and Tandon [1953]. Prior to this paper, two methods 
were (and still are) commonly used: (1) the regression of the phenotypic 
mean of all offspring of a given parent on the parent’s record; (2) the 
regression of offspring on parent, in which the parent’s record is repeated 
for each of its progeny. Kempthorne and Tandon’s technique, which 
they refer to as (3) the weighted regression technique, assigns weights 
to the progeny means which are functions of the number of progeny 
and a guessed value of a correlation coefficient p between deviations 
from regression associated with two progeny of the same parent. The 
difficulty here lies in the fact that p is unknown. The success of the 
general technique depends upon guessing p accurately. Presumably 
if the guessed value of p is close to p, the weighted technique is close to 
optimal. The precise effect of a poor guess for p does not seem to be 
known. 

The purposes of this paper are (1) to investigate the nature and 
magnitude of the correlation coefficient p, and (2) to compare the 
efficiencies of the various techniques with respect to data from a popula- 


tion of poultry. 
THEORY 


General: Following the notation of Kempthorne and Tandon, the 
general model for the regression of offspring on one parent would be 


1Journal Paper Number 1659, Purdue University Agricultural Experiment Station. 
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Yor br PCG — eee (1) 

where 

Y;, = the phenotypic value of the kth progeny of parent J, 

X, = the phenotypic value of parent J, 
average phenotypic value of the offspring population, 
average phenotypic value of the parent population, 

e;, = the deviation peculiar to the kth progeny of parent j, 
and 


6 
From such a model Kempthorne and Tandon showed that 
Hex) = op(1 et ole 
EE jx€;%°) — oP( pr al B’), 


= 
iS 
ll 


the regression coefficient of Y on X. 


and 

(pi > B°)/(1 aa Bs 

where p, is the correlation between progeny having a common parent 

(X,). This result is completely general when op = Var Y,,) = 

Var (X;) and would apply to situations in which non-additivity existed 

in the underlying genetic model and/or if correlations existed between 

the environments within progeny groups. In such a case, p, could be 

considered as the repeatability between members of a progeny group. 
In order to simplify the considerations to follow, a base population 

having certain characteristics is assumed. 


p 


(i) Only additive genetic variance is present. 

(ii) Effects of environment are random, so that the environmental 
correlation between individuals in a progeny group is essentially 
zero. 

(iii) Random mating is practiced. 


In the classical linear regression model, the e;,, form a set of mutually 
uncorrelated random variables. That such is not the case in this instance 
can be clearly demonstrated, even in the completely additive situation. 
If we knew the true breeding values (genotypic values in the additive 
genetic model) of both the sires and the dams, and mating was random, 
we would use the model 


Vij. = by + 3(9; =) Se ACE oy. 1) “Onn sta erie (2) 
where 


g; = the true breeding value of the parent of interest, 
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g: = the true breeding value of the other parent, 
6;;, = the individual genetic deviation from the mean of the 
sire and dam due to segregation, 
€;;, = the random environmental deviation, 
and 


uw = EQg;) = E@:). 


Under the assumption (i) of additive genetic variance, the total 
genetic variance og is entirely additive; hence it is clear that 
Var (g;) = Var (g;) = o¢ » 
and Var (4;;.) = og/2, 

Var (ae) = On 5 

This model (2) meets the conditions of the classical model in that 
all random components are uncorrelated. In the application of this 
model, however, only the phenotypic value of one parent is known, 
while the progeny phenotypes center on the parent’s breeding value. 
To illustrate, consider that with purely additive gene effects, h? may 
be considered as the regression of the parent's breeding value (in this 
case genotypic value) on the parent’s phenotypic value, so that the 
estimate of the breeding value of the parent based on its phenotype is 


(Gn (Oh) h'(X; =i) (3) 
but the true breeding value of the parent would be 
(gg, -m =h(Xi -~H +43; (4) 


where g; = the breeding value of the parent, X; = the phenotypic value 
of the parent, and 7; = the discrepancy between the estimated and 
actual breeding values. It is important to note that the progeny of the 
parent will vary around the point determined by (4) and not around 
the point determined by (3). It is this fact which causes the correla- 
tion between the errors of progeny from the same parent. 

Therefore, in applying equation (2) we must replace (g; — m) by 
(4), yielding, since 6 = h?/2 under the assumptions, 


Ve [se aie R(X; i‘ Lu) G af; =z 3(Gi ia L) A Oess + sin (5) 


Comparing (1) and (5) we see that the term in (5) comparable to 
e;, im (1) is 


A = 1 teas ACE — p) + dix etna (6) 
From (5) and (6) is obtained the model 
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Y six = By 53 R(X; ee H) Tie ijk » (5’) 


which is virtually identical to (1). The subscript 7 in (5’) is actually 
superflous, since the effect of the sire is embedded in e;;, , so that 
dropping the subscript 7 yields model (1). 

Since E(e;;,) = 0, it follows from either (6) or (4) that HU;) = 0. 

Also, the variance of [; would be 
EGY == (1h oa 

Special Cases: I. If we now make the further simplifying assump- 
tion (iv) that no two progeny of the given parent (the dam) have the 
same parent of the opposite sex (sire), we may rewrite (5) as, 

Yin =e BA Be) el oe a ae) eer Gree (7) 
Now from (6) or (7) we see that, 

Ee.) = 20 — W)og + 406 + 406 + on = (1 — */Do? , 
and 

Een) = 2EC;) = 20 — h’)og : 
Then 
p = [Cov (in , ex)]/-V Var (jx) Var (jx) 

[21 — h’)og]/(l — h’/4)op = (h’ — h')/(4 — hi’). 
This result is exactly analogous to the result of Kempthorne and Tandon, 
for under the assumption (i)-(iv), their p, = h*/4 and B = h’/2. 

II. It often happens, especially in poultry breeding operations, that 
matings are made up so that a sample of s sires are each mated to a 
distinct random sample of dams so that d; dams are mated to sire 7. 
In order for our assumptions to meet this situation it is only necessary 
to replace assumption (iv) by a new assumption to the effect that, (v) 
all progeny of a given parent of interest (the dam) have the same parent 
of the opposite sex (sire). We may now consider Y as a function of the 
breeding values of the sire and dam, assuming completely additive 
gene effects, such that 

Yin = wy + 3(g; — w) + a Ose =) ere Oise Ar seeine (8) 

which is analogous to (2). Clearly the expected progeny mean for a 
given sire depends upon the breeding value of the sire, or 

EY six | 1) = wy + a(9: aoe (9) 


It is always the case, however, that only the dam’s phenotype 
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(X,;) is known, hence from (8) and (4) we may write 


2 


Yin =u th (Xu that daten, (0) 
or 
Ven = ts + 0X —O +ee, (11) 
where 
ey = 41, + One 1 Sem (12) 


This clearly shows that the genetic interpretation of the coefficient of 
regression of offspring phenotypic values on the dam’s phenotypic value 
is the same, regardless of whether the dams are mated to a random 
group of sires or to a single sire. On the intra-sire basis, 6 is estimated 
within sires, one estimate for each sire, and a linear combination of 
these estimates is taken as a single point estimate of 6. From (11) we 
see that for a given sire 2, 


Ee. | = 0, 
Ee. | 4) = EMTs) + ECS) + Ein) 
= o3(4 —h’ — h’)/4. 
This variance is less than the variance of e,;, from model (7) because 


ce’, does not contain an effect due to sire. The covariance is the same 
under both assumptions since from (12) we see that 


EC) ixCi in | 2) am oG(1 ae h’)/4. 


The correlation p* between the errors of progeny of a given dam for a 
fixed sire would then be 


ot = [od(l — )/Al/lo2(4 — W — ¥)/4) = WY /(4 —W — hi). 


This result is again equivalent to that obtained by Kempthorne and 
Tandon, since 


p* = (p* — B*”)/(1 — B*), 


where, under the assumptions including (v), p% is the correlation between 
progeny of the same dam in the conditional population of progeny of 
a given sire, and equals h?/(4 — h?), while 6* would have to be the 
correlation between parent and offspring in the conditional population 
or ?/V/4 — h, since cy , being the conditional variance within sires, 
is less than 2. For comparison, p* can also be evaluated in terms of 
the parameters under the assumptions including (iv), in which case, 
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p= (p: aa B)/ ae Las By 

Magnitude of p and p*: The quantity p or p*, depending on the 
mating structure, is the intrinsic parameter upon which the Kemp- 
thorne and Tandon weighted regression technique is based. Therefore, 
it seems important to consider the possible values which p or p* may 
assume, in order to be able to choose the most efficient method of 
estimating the regression of interest, or, if the weighted technique is 
used, to enable an enlightened ‘‘guess”’ as to its value. 

Examination of the values of p and p* in terms of h” shows that 
p = p* = O if, and only if, h”> = 0 or 1. Conversely p or p* > 0 for 
0 <h’ <1. Elementary analytical techniques show that p reaches a 
maximum value when h? = .536, at which value p = .067. Similarly, 
p* achieves an absolute maximum when h’ = .586, when p* = .079. 
The functional relationship between p and h’ is illustrated in Figure 1. 
It is recognized that these may be minimal values for p and p*, since 
failure of assumption (ii) in particular will inflate these correlations. 
The effect on p of relaxing assumption (i) is not clear. The important 
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FIGURE 1 
THE ReLationsHip BETWEEN p AND h?. p = (h? — h*)/(4 — ht) 


EFFICIENCIES OF HERITABILITY ESTIMATES 487 


point is that these values would be unlikely to be large especially in a 
poultry population, where environmental correlations would be small 
or non-existent. In some larger species such as dairy cattle, such 
environmental correlations could have sizable values. 

A primary advantage of the offspring on parent regression technique 
is that unbiased estimates of h” may be obtained when the parents are 
selected for the trait under consideration. It is of interest to determine 
the effect of selection of the parents on the correlation between the 
errors and indirectly on the variance of the estimated ’s. It has already 
been observed that a general expression for p under any mating structure 
would be 


o=(m —P)/1 —17)-= [on — B(ex/enI/[1 — Box/cy)], 


where p; is the phenotypic correlation between progeny of the same 
parent and r is the correlation between parent and offspring, 8 being a 
constant. It is observed that for special Cases I and II, 


p: = (Cov (Vix 5 Yix))/Var (Yn) = [Box + FEU») Vor 
= S'(ox/cy) + 21 — h’)(oG/cy). 
Substituting this value for p, in the preceding equation yields 
0 = [20 = W(o2/oD1/ll — B%(o2/e2)] = 3 — W)ea/(or — Fox) 


It is clear that as the usual uni-directional selection occurs on the parents, 
the values of both the numerator and denominator will decrease. To 
evaluate this change it is now necessary to evaluate o, for a specific 
mating structure. For illustration assume a hierarchal structure in 
which 


of = Boz +i — hog + 306 + on. 
Substitution of this value for o7 in the preceding equation yields 


A 11 — W)og we ii ae 
fa ees sol tom ber Se 


It is seen that while «2 may be much smaller than oy , due to selection 
of parents, the value p* is independent of the value of ox. Similarly, 
in special Case I where each progeny of the dam has a different sire, 
it can be shown that the value of p is unaffected by selection of dams. 

Comparison of the three estimation procedures: All three procedures 
for estimating 6 are of course unbiased. The variances of the estimates 
are the prime consideration. Tor any linear unbiased estimator of 8, 
in which 
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B= > DAL; — De) oe wa, = 2); (13) 
the variance will be 


a= o(1 — MD wile, — PA +a T)mV(X wie, — 27, 04) 


; 


where 


& 
I 


D; w;2,/ > W; ; le e/( a p); 


ee Dy Yir/N; , a Var (€;x), 
k 


and 


w; = a weighting factor applied to the information from the 
jth dam. 


If a hierarchal mating structure is used, p would be replaced by p* and 
T by T* = p*/(1 — p*) in this discussion. 

Kempthorne and Tandon showed that the minimum variance re- 
sulted when 


w,=n/l+n,7), 
in which case the variance (14) reduces to 


a3, = o (1 =< PDS WD Ce its (15) 


This is the optimum weighting applied to the Kempthorne-Tandon 
weighted regression technique or method (3). However, the value of 
T is never known exactly so that it is necessary to guess a value for T, 
which may be indicated by 7. The weights then are 


w; = n;/(1 + 7;7), 


and the variance (14) reduces to 


He ae fea CAROL e 
en ma se. oh?” 


and 


ls 


7 nN; 


where 


t=) ( Pani (an: 


This value is larger than (15) since the weighting is not optimum, 
but will approach the minimum variance (15) as + approaches 7’. 
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Method (2), or the technique in which the dam’s record is repeated 
for each progeny record, is in fact a special case of the Kempthorne- 
Tandon technique in which 7 is set equal to zero. In this case w; = 1; 
and when these weights are used in (14) or 7 considered to be zero in 
(16) the variance becomes 


3, =o0(1 — p) Dad +a2@,-B/[Lne -DT, (1) 
in which 7 = pa n;X;/ Di N; - 


In method (1), in which the progeny means are regressed on the 
dam’s record, w; is simply set equal to one. From (14) the variance of 
this estimate is 


d= 0-9) DYE @,-aL@-aT, 08) 


and @ = >, z;/d, where d is the number of dams. 
It may be noted that ifm, = m2 = ***M% = N, then (14) reduces to 


og = ofl +m — Iel/m 2; — (19) 


and is the same for all three estimation techniques. 

The minimum variance occurs when 7 = +. Consequently if 
T = 0, then the repeated dam technique (method 2) will yield the 
minimum variance. If T # 0, then the weighted technique (method 3) 
will approach the minimum variance provided the estimate 7 is close 
to T. Since the relationship between h? and p is clear (granted the 
genetic assumptions), an intelligent guess for r can usually be made by 
applying prior knowledge. 

The variances of the three estimates of 6 are also considerably 
affected by the distribution of the n, values. A detailed discussion of 
this point will be presented in a subsequent paper. 


AN ILLUSTRATION 


Kempthorne and Tandon found little difference between the vari- 
ances of the three estimates in their illustration involving dairy cattle 
data. As they point out, this result is not surprising since the average 
n was only 1.39 and the estimate of p; was actually negative. Poultry 
data involving larger numbers of progeny should yield more reliable 
estimates of the parameters and more precise estimates of the variances 
of the three estimated 6’s. Data are available consisting of five genera- 
tions (progeny populations for 1952-56 inclusive) of White Leghorns 
previously described in detail by Yamada, Bohren and Crittenden 
[1957]. The trait considered is percent production to January 1, trans- 
formed to angles. The average n per dam over the five years is 7.1, 
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TABLE 1 


Estimates or PERTINENT POPULATION PARAMETERS AND NUMBER OF PARENTS 
AND PROGENY OBSERVED IN EacH OF FIVE YEARS. 


Year pi* Bi iT p* No. sires No. dams Average 7 
1952 064 .128 .059 .0557 10 106 yo 
1953 026 052 .029 .0282 10 92 9.5 
1954 .0572 .096 O51 0484 AGH 78 7.3 
1955 L06SS moO ODO MEEEO OZ 18 108 7.9 
1956 0404 .089 .0384 .0327 20 132 5.4 
Ave. 05138 .100 .046 8.0434 — —_ foil 


varying from 5.3 in 1956 to 9.5 in 1953 (Table 1). Since the sires were 
mated for the season to a random group of dams the regressions are to be 
estimated on an intrasire basis. Consequently interest will center on 
estimating p* , p* and 7*. Analysis of variance in the hierarchal model 
was used to derive estimates of the variance components for sires 
(c?), dams (c?) and progeny within dams [o5 = o (1 — p)]. From these 
components were derived estimates of p, (the full-sib correlation in this 
example) and preliminary estimates of h’, for the purpose of estimating 
T*, which in turn, is needed in selecting the appropriate weighting 
factors. These estimates were obtained from the variance components as 
pt = 6a/(6a + 6), 
and 
hy = 462/(6; + ¢2 + 60). 

Each annual value of h? so derived was considered as a preliminary 
estimate of 28 so B; = h?/2. The estimates 8, and s* were then used 
to obtain the estimates 7* and f*. The results are shown in Table 1. 
All values are relatively consistent from year to year and none of the 
values for é* are outside the expected range based on the theoretical 
additive genetic model. 

Since the value of 7* is unknown, the estimated value 7* derived 
from the data for each year was used in the formulae pertaining to 
the variances of the estimates. To approximate the minimum variance 
under the weighting technique (method 3) + was set equal to 7*. For 
the repeated parent technique (method 2), 7 was assumed to be zero. 

Estimates of 6 were obtained within each sire group in each year 
by each of the three estimation techniques. The variances of regressions 
in each sire group were estimated by use of the appropriate formula 
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(ie. 15, 17, or 18). The individual sire estimates of 8 were pooled over 
sires in years, using the reciprocals of the estimated variances as weights 
for the corresponding regression coefficients, to obtain a point estimate 
of B for each year. The estimated variance of the single point estimate 
of 6 for the year is the harmonic mean of the variances for each sire 
in the year divided by the number of sires in the year. Thus 

ie meas af a2 \ 7 a2 

@ = ¥ 6./63)/D 0/48), 

2 a 

and 


63 = 1/2, 1/6). 


To estimate heritability (h”), the estimates of 6 were doubled. The 
estimates of h? and the estimated standard errors for each of the three 
methods of derivation are presented for each of the five years in Table 2. 
It is clear that in each year the estimated standard errors of the re- 
gression estimates based on progeny means (method 1) are the largest. 


TABLE 2 


HerrraBrity EstimaTES AND STANDARD ERRORS, BaAsED ON REGRESSION 
CoEFFICIENTS ESTIMATED BY THREE DIFFERENT Metruops. 


Method 
Year 1 y 3 
1952 156 + .242 317 + .204 .281 + .202 
1953 — .006 + .171 — 073) ==. 166 — .058 + .166 
1954 .333 + .304 .294 + .297 .304 + .296 
1955 110 + .172 .098 + .169 .102 + .169 
1956 .309 + .216 .340 + .198 .342 + .198 


There is little difference in the efficiencies of methods (2) and (8) in 
these data. Unless the data to be considered have larger values of p 
or p* than observed in the present set of data, there appears to be little 
advantage in using the weighted technique (method 3) in preference 
to the repeated parent technique (method 2). 
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D. J. Finney, Editor 


164 NOTE: On a Formula for the Estimation of the Optimum 
Dressing of a Fertilizer 


F. PrmeEnTEL GoMEs, 
University of Sao Paulo, Sio Paulo, Brasil. 


If Mitscherlich’s law 
y == A[l rs 1071 


adequately represents the yield y of a crop to which z units of a nutrient 
have been applied, the optimum dressing x* (Pimentel Gomes [1953]) 
is given by the expression 


Awe - 
t loge 


z* = (1/c) log b. (1) 
Let x, be a standard dressing of a nutrient and u the response to it; 
formula (1) can be written 


Pees CL, 
(1/e) log (1 — 10 “u) loge 
where, as before, w is the unit price of the crop yield, and ¢ the unit price 
of the nutrient. This formula can be simplified through the expansion 
of its first term, as we shall show. 

Let 


pelo) loe es 


2 = cx,/loge, 


Chi ae Za wt 
10° °"u) log e a (1/c) log il = Cn 


Y = (i/o) log (e= 


We have 


dY 1 1 
aa aol} reciges | log e. 


But it is known (Cramer [1946] p. 123) that 


2 4 
Serene = cite Taney A 
reget 9 t Bot Baa t 5) 
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where B, , By, --- are Bernoulli numbers. This power series converges 
for | z| < 2z, and therefore can be integrated with this restriction. 
We obtain: 


os Re a eat se Sages 
¥ = (1/e) log (2 B, 519 = ey 414 i 
whence, 

Y = 2,[(1/2) — (1/24)(ex./log e) + (1/2880) cx./log Oi eee 


The last series is alternating. If we keep only the first term, the error 
committed is less than (x,/24 log e) cx, , that is, less than cx/10. 
If two terms are kept, the error of the approximation is less than 


(x,,/2880) (cx,,/log e)”. 


Evidently, in applications, the first term gives a rather good approxi- 
mation suitable for most cases, and this first approximation is inde- 
pendent of the parameters of the curve. To show how good this approxi- 
mation is we give in Table 1 exact and approximate values of Y for 
the standard dressing (z,) of 60 kg/hectare, which is suitable for most 
cases in practice. 


TABLE 1 
Values of c Values of Y (kg/hectare) 
2 SS os ee ee 
(hectares /kg) Exact Approximate 
( ea ee Pe tet ae 

0.0020 29.4 30.0 
0.0050 28.3 30.0 
0°0090 26.9 30.0 
0.0100 26.6 30.0 


In most cases the approximate value 
a* = (1/2)2, + (1/c) log (wu/te,) , (2) 


exceeds the true value, formula (1), by less than 5 percent. 

From (2) we conclude also that, when the additional income (wu) 
produced by the nutrient applied is equal to the additional cost of 
fertilization (tz,), the optimum dressing is still approximately (4)au - 

Let us suppose, now, that 2* will be estimated with the aid of a 
value of c obtained from a large group of previous experiments. Then 
we may take c as constant and obtained from either formula (1) or (2) 
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dx* = (log e/e) ae ; 
u 
hence an estimate of the variance of x* is 
23° 
= gs 
V(e*) = (log ¢/e)* = 


where the response wu is supposed to have been estimated by the differ- 
ence between two means of r replications each. It is interesting to 
note that this estimate of V (x*) is independent of the unit prices w and t. 
On the other hand, if uw and ¢ are estimated with data of the same 
experiment, then it is easier to use formula (2), which gives 


du «dé wu 
ee Ay eee Met oe 
dx (log e/é) - 2 log ae 
hence 
A\2 At Wu : 
V(a*) = (log e/ué) Vu) + (1/e \(tog wz V@) 


— 2(log e/é’) (10g wt) Cov (u, é). 
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EKAMBARAM, S. K. The Statistical Basis of Quality Control Charts. A 

12 Manual for Business and Factory Managers. Bombay and London: Asia, 
Publishing House, 1960. Pp x + 96. 16 Tables and 14 Diagrams. Rs. 6.50, 
16s.6d. 


L. R. SHenton, University of Manchester, Manchester, England. 


This short introductory account of control theory and practice is intended for 
managerial and technical personnel, and also for initial courses at university level. 
After an account of frequency distributions leading to the normal law, the control 
chart for quality is described, and then the control of fraction defective, concluding 
with remarks on acceptance sampling. In describing the underlying statistical 
ideas Professor Ekambaram is unusually lucid and achieves considerable success, 
although at times the terminology is unconventional. 

It seems unnecessary to have evaluated separately and in detail the first four 
moments of the Normal, Binomial and Poisson distributions; in any case little use 
is made of wu; and ps. On the other hand there is scant reference to the normal 
probability integral, and few values are quoted. The use of .Cr for the usual binomial 
coefficient (7) may be confused with n times C, , and the printing of > 2«/N as 
1/N >> x seems unfortunate. The formula for a normal probability density is 
incorrectly printed on p.21. The printing is sometimes tenuous and certainly 
lacks uniformity. 


LIEBERMAN, GERALD J. AND DONALD B. OWEN. Tables of the 
13 Hypergeometric Probability Distribution. Stanford: Stanford University 
Press, 1961. pp. vi + 726, Part I: 6 Tables; Part II: Tables; Appendix. $15.00. 


R. A. Brapury, The Florida State University, Tallahassee, Florida, U.S.A. 


This book of tables gives a comprehensive tabulation of the hypergeometric 
probability distribution together with discussions of typical applications. In the 
notation of the book 
p(x) = p(N,n,k, x) =kin\(N — k)"N — n)'V/(k —2)\(n — x)laIN\UN —k —n+2)! 
where max [0,n +k — N] <2 < min [n, k]. P(x) is used as the cumulative dis- 
tribution. Symmetries are noted that permit reduction of the volume of the tables. 
Part II of the book and the Appendix consist of tables. The main section of tables, 
pp. 33-627, is a tabulation of p(x) and P(x) for N = 2,n = 1 through N = 100, 
n = 50. The second section of tables, pp. 628-705, shows values of the same quan- 
tities for N = 1000, n = 500. The third section of tables, pp. 706-713, gives p(x) 
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and P(x) for N = 100, n = 50 through N = 2000, n = 1000 with k = n — 1,7; 
n = N/2. The Appendix consists of values of log N! for N = 1, --: , 2000. The 
computations were effected on an IBM 704 computer and sufficient checks were used 
to insure the six-place accuracy given for all values of p(x) and P(x). 

Part I of the book deals with comments and definitions relative to production 
of the tables, applications of the tables, and approximations to the hypergeometric 
probability distribution. The discussion of applications is clear, concise and useful. 
The examples given are Applications to a Sequential Procedure, Applications to 
Tests of the Equality of Two Proportions, Applications to the Distribution of the 
Number of Exceedances, Applications to the Binomial Distribution (Bayesian 
Prediction), and Applications to Sampling Inspection. A bibliography containing 
sixty-six references will be useful to users of these tables. 

Tables of the Hypergeometric Probability Distribution will become an important 
reference volume and one recommended to all working in areas of relevance. 


PILLAI, K. C. §. Statistical Tables for Tests of Multivariate Hypotheses. 
14 Manila: Statistical Center, University of the Philippines. 1960. pp viii + 46. 


M. J. R. Huary, Rothamsted Experimental Station, Harpenden, England. 


Most of the significance tests in common use can be regarded as special cases 
of the F-test for the equality of two estimated variances. The corresponding 
multivariate test would be one for the equality of two variance-covariance matrices, 
and the natural requirement that the result of the test should not depend on the 
scales of measurement implies that the test criterion must be some function of the 
latent roots (eigenvalues) of the two matrices. Denoting the matrices by S: and Sz», 
all tests so far proposed have in fact been based on the latent roots of the “ratio” 
SiS;° or of $:(S; + $2)". Wilks’ A criterion is equal to the product of the roots, 
i.e. the determinant of the matrix, while other authors have suggested using the 
sum of the roots or the largest or smallest root. 

Pillai’s tables are for use with the last two criteria. Table 1 gives the upper 
5% and 1% points of the largest latent root for s, the number of measurements, 
equal to 2(1)6, while Tables 2 and 3 give the same percentage points for the sum 
of the roots of $:(S; -+ S2)~! and of S:S2" respectively for s = 2(1)8. The last 
criterion is equivalent to Hotelling’s 73. Significance levels of the largest latent 
root have been tabulated very fully by F. G. Foster and D. H. Rees for s = 2, 3 and 
4. Otherwise, the tables appear to break new ground. A short introduction includes 
several examples of multivariate problems to which the suggested tests can be applied. 

The user of the tables is not given all the help he might expect. The parameters 
are not as convenient as the degrees of freedom in the Foster-Rees tables, and the 
parameter values make interpolation awkward. No guidance on interpolation is 
given in the introduction, and there is no indication of the accuracy of the tabular 
entries although these are based on approximations to the true distribution functions. 
In tables 2 and 3, the 5% and 1% values for a given s do not appear at a single 
opening of the book. 

In spite of these shortcomings, the tables ought to be widely used, if only to 
provide experience on which to base further work. Several questions cannot at 
present be answered. In particular, no guidance can be given in choosing between 
the three alternative tests. Another open question concerns the robustness of the 
tests to non-normality; analogy with the ordinary F-test suggests that they may be 
unduly sensitive in this respect. 
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YATES, F. Sampling Methods for Censuses and Surveys. 3rd Edition. 
15 London: Charles Griffin and Co. Ltd. 1960. Pp xvi + 440.54s. 


M. D. Mountrorp, The Nature Conservancy, London, England. 


This book is still being read after ten years since its first edition and has thus, 
according to the modern definition, achieved the status of a classic. 

This, the third edition, is a reprint of the second edition with an extra chapter 
on the use of electronic computers in the analysis of censuses and surveys. The 
value of high-speed computers in large-scale survey work is now unquestioned; 
Dr. Yates’ exposition of their workings and the timeless character of the earlier 
chapters brings this book right up to date. 

The new chapter begins with a concise description of the main features of a 
computer and of the principles of constructing a programme of machine orders. He 
then presents a general programme for the analysis of survey results. This same 
general programme, with slight modifications, will also serve to instruct the computer 
to analyse the results of many different types of sampling schemes, including the 
simple random. stratified, ratio, regression and multiphase methods. The exposition 
is clear and unvarnished, though the reader may be dazzled by the simplicity of the 
unified treatment of the different sampling methods. 

As astandard manual for the practical survey worker this book, to my knowledge, 
still has no equal. 


ABSTRACTS 


The following are abstracts of papers presented at meetings of the 
British Region on February 28 and April 18, 1961. 


P. D. OLDHAM (M. R. C. Pneumoconiosis Research Unit, Liandough Hos- 
753 pital, Penarth, Glamorgan). The Distribution of Arterial Pressure in the 
General Population. 


Blood pressure measurements of the general population form smooth distribu- 
tion curves whose means and standard deviations vary with age and differ between 
the sexes. No other common factors influencing arterial pressure to a major extent 
have been discovered, nor does it appear, from second surveys of the same samples, 
that the distribution of change of pressure will materially depend on simple, common 
factors. The interpretation of these distributions raises the problem, occurring 
in all fields of medicine, of distinguishing normality from abnormality. The tendency 
is for unsatisfactory and arbitrary rules to be adopted for this purpose, rules which 
ignore the evident fact that abnormality cannot be diagnosed from the result of a 
single test, since the innumerable factors influencing function must be mutually 
correlated. 


L. R. TAYLOR (Rothamsted Experimental Station, Harpenden, Herts.). 


rs A Power Law Transformation for Aggregated Populations. 


The individuals of any species affect each other in many ways. The total effect 
of this attraction or repulsion appears, in the spacial distribution of the population, 
as a departure from the statistically simple ideal of randomness. The variance is 
affected and some powerful analytical technique are inapplicable. In set experiments, 
where the mean varies only 100 or 200% between treatments this can be overcome 
by a transformation devised for the occasion and not necessarily very effective at 
other population densities. To be effective in field work, where means may cover 
6 or more log cycles, a transformation must have a sound basis. 

Such a system of transformations has been found, empirically, which fits all 
data so far available. It derives from the hypothesis that variance is proportional 
to a fractional power of the mean (s? = am’). Considerable evidence supports 
this; only 1 out of over 30 populations examined shows appreciable deviation. The 
index b appears to be much more specifically stable than the factor a which varies 
with sampling method, population trend etc., (which may be very local e.g. increase 
by reproduction). It is suggested that b is a specific Index of Aggregative Behaviour, 
present in all individuals, possibly influenced by environment but independent of 
population density and trends. 

The transformation function is ¢(m) = Q i) ml? dm where ¢(2z) is the trans- 
formation for individual counts, Q is a constant and m and b are derived from the 
power law for variance. a disappears in transformation which therefore remains 
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the same for the same species with different sampling practices in the material so 
far examined. (For further details see Nature (1961), 189, 732-5.) 


G. HARRINGTON (A. R. C. Statistics Group, School of Agriculture, Cam- 


755 ‘ ‘ : 
bridge). Studies of Visual Judgments of Quality in Bacon. 


The series of experiments to be described studied the manner in which ex- 
perienced and naive judges handled their rating scale when visually assessing rela- 
tively simple characteristics of bacon. The sorts of attributes involved were the 
“proportion of lean to fat’? on a cut surface revealed when a bacon side is cut into 
two halves, and the component features of this. Most experiments were carried 
out using photographs, in some cases mailed to the judges, although a few involved 
actual bacon sides. Balanced incomplete block arrangements were used so that the 
average quality of the various batches judged could be varied systematically. The 
analysis was concerned with the relative importance of variations in scores intro- 
duced by alterations of “judging standard” (position on rating scale) from batch 
to batch, judge differences etc., and the interrelations between various scores and 
measurements which may have inffuenced them. 


J. M. TANNER and M. J. R. HEALY (Institute of Child Health, Gt. Ormond 
756 St., London, W. C. 1. and Rothamsted Experimental Station, Harpenden, 
Herts.). Assessment of Maturity from X-rays of the Wrist and Hand. 


The bones of the wrist and hand pass through a number of distinguishable 
stages of growth during the period between birth and adulthood, and their overall 
state of development may be taken as a measure of the individual’s level of physical 
development. This is often done by way of an assessment of “skeletal age’. A 
scoring system for deriving skeletal age from an X-ray will be described. 


The following are abstracts of papers presented at the meetings of E.N.A.R. held 
at Cornell University, Ithaca, N. Y., on April 20, 21, 22, 1961. 


757 W. H. BEYER and R. E. BARGMANN (Virginia Polytechnic Institute, 
Blacksburg, Virginia). Symmetrical Complementation Design. 


This design is intended for those experimental situations where the total amount 
of three treatments is a constant. The levels of the treatments are referred to a 
common unit of measurement, and are equally spaced. Certain cell entries are 
omitted to insure complete exchangeability of the three treatments. The usual 
additive model is assumed. This design differs from the usual types of design, in 
that the number of estimable contrasts is limited. Estimable functions in one 
treatment only and in two treatments are presented for the general case of p levels. 
Several methods are employed in order to obtain estimates of the treatment effects 
under various constraints. These estimates are rather meaningless quantities, as 
it is only when they are combined in estimable functions that unique results are 
obtained. Sums of squares and test statistics are presented for the various 
estimable hypotheses formulated. This paper shows that the “hypothesis of sub- 
stitution” is one of the most important to consider. If accepted this says that 
applying one treatment at a low level and another at a high level does not produce 
results which are different when the two treatments are interchanged. Indication 
of how one might consider response functions for single treatments is given. The 
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analysis is also extended to an analysis of covariance and then further to a multi- 
variate analysis. Recommendations for interpretation and statement of limitations 
are made in detail. 


A. E. BRANDT (Statistical Section, Agricultural Experiment Stations, 
758 University of Florida, Gainesville, Fla.). The Analysis and Interpretation of 
Half-Replicate Experiments. 


An IBM 650 program for analyzing the data from a factorial experiment in- 
volving not more than 8 factors or independent variables or from an experiment 
which can be arranged in factorial form is presented. Of the 8 factors, 6 must have 
less than 10 classes and the remaining 2 may have 10 or more. 

The 650 may be used to design a half-replicate experiment, that is, to designate 
the treatment combinations to be used, and to analyze the results. The output 
consists, in the case of a (2) half-replicate experiment, of (2)" — 1 cards. Of these, 
2n — 2 contain the information concerning variances. These cards occur in pairs, 
one member of each being called an alias, on the basis of variances. 

The data from a (2)%(4)? experiment presented by W. H. Horton, Westing- 
house Electric Corporation, to a seminar July 14, 1960 were analyzed by this program 
and the results submitted. A question is raised as to the interpretation of results. 
The effect of temperature level proved to be highly significant but one wonders 
if its alias, in this case a high order interaction involving the other 2-level factor and 
2 levels of each of the 4-level factors, is to be ignored. 

The results of a (2)8(4)? field experiment over 4 years were analyzed and the 
results presented with both identifications given for each separate sum of squares. 
The question is again raised as to which member of a pair is to be accepted. 


BYRON WM. BROWN, JR. (School of Public Health, University of Minne- 
759 sota, Minneapolis, Minnesota). Some Characteristics of the Spearman- 
Karber Estimator in Bioassay. 


Let the dose-response function in quantal assay be a distribution function with 
mean p. An experiment for estimation of u involves n subjects tested at each of 
the dose levels 7; = x + id, i = 0, +1, +2, --- , where n, d and x are fixed. The 
Spearman estimator is defined for this infinite experiment and shown to have finite 
mean and variance. Maxima are obtained for the bias, taken over all placements, 
xo , of the dose mesh, for (i) all distribution functions and (ii) all unimodal dis- 
tribution functions with specified maximum slope. These maxima are compared 
with the sequence of bounds obtained using Euler-MacLaurin formulae. The 
mean square error of the Spearman estimator is given for x) randomly chosen over 
(0, d). The minimum mean square error of the estimator, for random choice of 
xo and fixed n’ = n/d, occurs when n = 1 and d = 1/n’. As n’ becomes infinite 
the estimator is consistent. The asymptotic variance of the estimator is defined 
and used to define asymptotic efficiency relative to the information. The only 
symmetric dose-response function, with » as a translation parameter, for which 
the Spearman estimator has full asymptotic efficiency is the logistic distribution. 
There are distributions (with first moments) for which the Spearman estimator 
has asymptotic efficiency arbitrarily close to zero. High efficiencies are computed 
for the parametric models commonly used in bioassay. When the scale parameter is 
unknown the asymptotic efficiency of the Spearman estimator is at least that for the 
case of scale parameter known. 
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RICHARD G. CORNELL (The Florida State University, Tallahassee, Fla.). 
760 Tables of Sample Sizes and Applications for Estimating some Monotonic 
Functions of the Ratio of Two Independent Poisson Variates. 


The calculation of the efficiency of a vaccine or of an air sampling device is 
an example when it is necessary to estimate monotonic functions of the ratio of 
two independent Poisson variates. Confidence limits on these efficiencies can be 
obtained by using the approach presented by Bross in “A Confidence Interval for 
a Percentage Increase,’ Biometrics (1954). It is also possible to use this approach 
to complete the sample size necessary to attain a confidence interval of predeter- 
mined length for any true efficiency. Sample sizes computed in this manner are 
tabled and applications are illustrated in this paper. 


JAMES E. GRIZZLE (Dept. of Biostatistics, Univ. of North Carolina, 
761 Chapel Hill, N. C.). Asymptotic Power of Tests of Linear Hypotheses Using 
the Probit or Logit Transformation. 


The statistic for testing the fit of a model, or the statistic for testing a linear 
hypothesis under the model, when using probits or logits, has a central x2-distribu- 
tion for large samples if the null hypothesis is true. If it is not true, the test statistic 
has, asymptotically, a non-central x2-distribution with a non-centrality parameter 
that depends on the alternative hypothesis, the model and the transformation. 
Non-centrality parameters associated with tests of the two types of hypotheses 
are derived, and some cases of interest in bioassay when the response is quantal 
are examined. 


JOHN GURLAND (Mathematics Research Center, U. S. Army, Univ. of 
762 Wisconsin, Madison, Wisc.). Some Bioassay Techniques for the Determi- 
nation of Minute Residues. 


A fortification procedure, whereby known amounts of the Standard material 
are added to weak test preparations, is suggested for biological assays involving 
housefly mortality and employing the ‘lm method” and “topical method” respec- 
tively. According to the former method a thin film of toxicant is distributed on 
the walls of the jar in which the insects are exposed. Increasing doses are admin- 
istered by increasing the volume of fortified extract employed in distributing the 
film of toxicant on the walls of the jar. According to the latter method, a constant 
volume (one microleter) is applied to the mesonotum of each fly exposed. This 
requires a separate fortification for each dose. By maintaining a constant ratio of 
toxicant added to toxicant present it is possible to apply the tests of linearity and 
parallelism and also to obtain an estimate of relative potency. Practical considera- 
tions in preparing solutions of desired strength (although the potency is unknown 
and must be estimated), are suggested, whereby a trial value of the relative potency 
is employed in obtaining the final estimate. 


763 DEWEY L. HARRIS (Iowa State University, Ames, Iowa). A Monte Carlo 
Study of the Influence of Errors of Parameter Extimation Upon Index Selection. 


The theory of genetic selection indexes is such that, with knowledge of certain 
genetic and phenotypic parameters, the index which will yield maximum genetic 
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improvement may be chosen. However, in practice, these parameters are not 
known exactly and estimates are used in index construction. The inaccuracies of 
estimation result in indexes which will yield progress somewhat less than the maxi- 
mum attainable progress. The errors of parameter estimation also lead to inac- 
curacies of estimating the progress from selection on a particular calculated index. 

Parameter estimation from analyses of variance and covariance among traits 
of individuals classified into paternal half-sib groups was considered. The magnitude 
of the mean decrease in progress, the tendency to over- or under-estimate progress, 
and the accuracy of estimating progress were evaluated by ‘Monte Carlo” sampling 
procedures and by the development of approximate equations for various combina- 
tions of the true genetic and phenotypic parameters and amounts of data used for 
estimation. The results for these situations indicate that data involving at least 
1000 individuals are necessary for construction of a reasonably effective index. 
However, the accuracy of estimating progress is not very accurate with this volume 
of data. 


H. O. HARTLEY (lowa State University, Ames, Iowa). Analytic Studies 


he: of Sample Surveys. 


Analytic studies in sample surveys are concerned with estimating and comparing 
the mean-characteristics for certain sub-sections of the population called ‘domains 
of studies’. The first part of the paper is concerned with formulas for the estimation 
of the totals and means of characteristics for all the units in a domain. Formulas 
of the variances of the estinates are also provided. 

The second part of the paper raises the question of ‘optimum design’ for analytic 
studies and formulates this as a problem of minimizing the survey cost subject to 
tolerances for the variances of domain comparisons. Non-linear programming is 
applied to a simple special case in which domains are strata. 


THEODOR HEIDHUES (Department of Animal Husbandry, Cornell Uni- 
765 versity, Ithaca, New York). Relative Accuracy of Selection Indices Based 
on Estimated Genotypic and Phenotypic Parameters. 


Empirical sampling techniques were used to investigate the effect of errors of 
parameter estimation on the accuracy of the selection index method. Under the 
assumptions of multivariate normal distribution of genotypic values and phenotypic 
observations and no genotype-environment interaction, samples of various sizes 
from two classes of underlying distributions were generated by an electronic com- 
puter. Genotypic values and phenotypic observations were computed such that 
expected values, variances and covariances of generated variables were equal to 
the respective underlying population parameters. 

The first class of problems was concerned with indices which include phenotypic 
observations on an individual and its relatives in the same trait. The covariance 
matrix between genotypic values of relatives can be inferred from knowledge of 
the genetic mechanism and need not be estimated. The measure of the relative 
accuracy of an index based on estimated as compared to true parameters was taken 
to be the ratio of realized to expected correlation between genotypic value and its 
estimate by a particular index procedure. The decrease in accuracy due to use of 
estimated parameters depended upon the underlying ratio of genotypic to total 
variance and sample size. Evidence is strong that full utilization of genetic knowl- 
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edge of the population structure increases the accuracy of an index procedure. 
Variables with low partial correlation with the genotypic value to be evaluated 
and high correlation with other variables in the index equation should be discarded 
from the index in certain cases. An index based on an estimated covariance matrix 
yielded almost identical accuracies when applied to the same sample or to different 
samples of the same population. 

The second more general class of problems included indices based on phenotypic 
observations of the trait under selection and on a genetically correlated trait. The 
decrease in accuracy depended strongly on the ratio of genotypic to total variance 
of the selected trait. If estimates of elements of the phenotypic or genotypic co- 
variance matrix are “unreasonable”, i.e. if they exceed theoretically determined 
limits, they should be modified to increase the accuracy. The occasionally used 
practice of assuming the genotypic covariance between two traits to be zero cannot 
be generally recommended. 


BE. H. LEHMAN (Statistical and Computing Laboratory, Purdue University, 
766 Lafayette, Ind.). The Peculiar Variance of the Estimator, of a, the Scale 
Parameter of the Weibull Distribution. 


Assume N units selected randomly from a population whose life span follows 
the Weibull density function, F(t) = 1 — exp [—(¢”%/a)], t > 0, M, (the shape 
parameter) known. Observe the failure times of the first few units and stop the 
test when R have failed and time 7 has elapsed. The maximum likelihood estimator 
é of a derived from this test possesses some baffling properties. If R and T happen 
to be fixed and small, the variance of @ considered as a function of N, decreases 
at first, then increases for awhile, and later decreases monotonically. Thus it 
appears that for a certain interval of N, a small sample gives more information 
than a large one. The reason for this is that if r, the actual number of failures 
(> R, small) exceeds R by only a small integer, or if d, the actual test duration 
(> T, small), exceeds T by only a small period, the 4 then employed is badly biased 
and variant. For small N, the probability of using these poor estimators grows 
for awhile before it shrinks, and hence the variance of @ as a whole follows this same 
surprising sinuous pattern. 


DONALD CG. MARTIN and 8. K. KATTI (Florida State University, Talla- 
767 hassee, Florida). Fitting Certain Contagious Distributions to Some of the 
Available Data by Maximum Likelihood Method. 


The sample distributions obtained by Beal, [1940], Hcology 21, Bliss and Fisher, 
[1953], Biometrics 9, and McGuire et al., {1957}, Biometrics, 13 have been frequently 
employed to test the fit of many theoretical distributions. The distributions that 
have been found to have large enough regions of applicability are the Neyman Type 
A, Negative Binomial, Poisson Binomial, and the Inflated Poisson. All of these 
require numerical methods or tables to estimate the parameters by maximum likeli- 
hood. Specifically the problem of estimating parameters in the Neyman Type A 
and the Poisson Binomial is considerably involved. This has resulted in comparing 
of the newly formulated distributions with the inefficient fits, e.g. moment fits, 
of these distributions thereby confounding the superiority of the new distribution 
with the superiority of the method of estimation. The present authors have obtained 
maximum likelihood fits for most of the data that they feel are promising in studying 
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the problem of curve fitting by using an electronic computer. They would like to 
report that the values of the chi squares used for testing the goodness of fit do not 
indicate any consistent good fit by one of these distributions. 


M. R. SAMPFORD (A. R. C. Unit of Statistics, Aberdeen, Scotland). A 


sce Problem in Cluster Sampling with Replacement. 


When the units of a population fall naturally into clusters of unequal size, 
selection of clusters with probability proportional to size, with replacement of 
selected clusters, provides easily calculated and unbiased estimators of the popula- 
tion mean and the variance of its estimate. Sampling usually proceeds until a 
pre-determined number n of clusters (not necessarily all different) has been chosen: 
under this system some economy of resources 1s achieved (since the mean of a cluster 
selected twice need be determined only once), at the cost of some increase in variance, 
possibly large when sampling is from a small population or stratum. 

If resources to determine n cluster means are available, a sample containing 
n distinct clusters, and so providing a more precise estimate, might be preferred. 
If the usual method of sampling with replacement is continued until the (n + 1)th 
distinct unit is chosen for the first time, at the (r + 1)th drawing, the sample con- 
sisting of the first 7 cluster means (corresponding to the first n distinct clusters 
chosen) provides unbiased estimators when analyzed as though r (rather than ) 
had been pre-determined. 

A small modification provides an unbaised estimator of the sampling variance 
when clusters are sub-sampled. 


MARVIN A. SCHNEIDERMAN and PETER ARMITAGE (National 
Institutes of Health, and London School of Hygiene and Tropical Medicine, 
Bethesda, Md. and London, England). A Family of Truncated Sequential 
Plans. 


769 


For the normal deviate variance known, a family of truncated sequential plans 
(called “wedge” plans) with outer boundaries identical with those of Wald, Sobel- 
Wald, and Armitage have been developed. These plans have known, fixed Type I 
and Type II error and constitute a general class, of which the Wald (open) schemes 
are one extreme special case and the Armitage (restricted) schemes with a vertical 
middle boundary, the other extreme special case. 

Plans have been developed for both the one-sided (two-decision), and for the 
two-sided (three-decision) case. Monte Carlo trials comparing “‘wedge’’ schemes 
with equivalent open schemes (Wald and Sobel-Wald) show somewhat increased 
average sample sizes for the wedge schemes in the vicinity Hy , reduced sample 
sizes for values of the parameter, 6, between Hp and H, (6; > 60), and equivalent 
sample sizes for 6 > H,. The variance of the ASN appears smaller for the wedge 
schemes at all values of the parameter. Tables of coordinates of the wedge for nine 
common combinations of a and B have been computed, and will be published. 


R. J. TAYLOR and H. A. DAVID (Dept. of Statistics, Virginia Polytechnic 
770 Institute, Blacksburg, Va.). Sequential Allocation of Patients in Clinical 
Trials. 


This paper describes a scheme for sequentially altering the proportion of patients 
assigned to the various treatments of a clinical trial according to the results obtained 
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as the trial proceeds. Superior treatments are, by this means, allocated a higher 
proportion of patients than inferior ones. This alternation of proportion is performed 
by use of a weighting function, several alternative forms of which are given. A 
simulation study of the efficacy of the procedure with regard to its ability to select 
correctly the best treatment is described and the results presented. These results 
indicate that, with the use of appropriate weighting functions, this procedure is 
better able to select the best treatment than an equal allocation trial using the same 
number of patients. This comparison has been made on the basis of Sobel and 
Huyett’s (1957) study of the equal allocation case. The study shows that the 
weighting functions which are most efficacious in correctly selecting the best treat- 
ment are the ones that tend to assign the largest proportion of patients to the best 
treatment. 
A theoretical study of these statistics in special situations is also discussed. 


L. H. WADELL (Department of Animal Husbandry, Cornell University, 
771 Ithaca, N. Y.). Selection Bias in Intraclass Correlation Repeatability Esti- 
mates. 


Research workers in the field of quantitative genetics use functions of variance 
components as estimates of parameters needed in the design and application of 
selection programs. The estimates of these parameters generally have to come from 
selected data. This paper demonstrates the bias that is introduced into the intra- 
class correlation when computed from a one-way classification analysis with unequal 
subclass numbers where the unequal subclass numbers are caused by systematic 
truncation culling. Empirical sampling results are given to demonstrate this bias 
which varies with culling intensity and with the size of the true intraclass correlation. 
The bias introduced by increasing the culling intensity is greater for a low true 
repeatability than for a high true repeatability. A correction technique is given 
for this analysis which eliminates this bias. The accuracy of this method is supported 
by empirical sampling results. 


R. M. ZAKI, B. B. BHATTACHARYYA and R. L. ANDERSON (North 
772 Carolina State College, Raleigh, N. C.). Ona Problem of Production Planning 
Over Time. 


This paper is concerned with the decision problem faced by a firm which produces 
a nonstorable commodity and has to spend large amounts of capital on a specialized 
factor that could be idled part of the time by fluctuations in production. In partic- 
ular, it is assumed that the firm uses f resources, N, units of the rth resource being 
available for production. These resources are to be allocated to one or more of 7’ 
different time periods, n,: units of the rth resource to the tth period. Each of these 
nyt units can produce y+ units of output at a price less direct variable costs of pre . 
The cost of the specialized factor for being available in any of the 7’ time periods is 
approximated by a constant multiple of the maximum production in any one time 
period. The decision problem is to determine the allocation plan {n,:} which 


maximizes 
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subject to the restrictions 
T 
ens SON pe 9 ee 
t=1 


A linear programming formulation of the problem is given. A procedure is 
developed for finding the optimal solutions to the problem for each value of ¢ in 
the interval c > 0. Explicit optimal solutions are given for a simplified model 
with f = 1. 


The following are abstracts of papers presented to W.N.A.R. at the 
University of Washington, Seattle, Wash., June 14-17, 1961. 


WALTER A. BECKER and LAWRENCE R. BERG (Washington State 
773 University, Pullman and Puyallup, Washington). Factors Affecting the 
Sensitivity of Growth Experiments. 


A series of chick growth experiments utilizing various genotypes and diets 
were performed to investigate the influence of different factors upon the sensitivity 
of experiments. F statistics, M.S. among treatments/M.S. among individuals, 
within treatments in one-lay layouts, were used to measure sensitivity. All body 
weight data were transformed to logarithms. 

In experiments that determined differences among diets, the within treatment 
variance increased as the animals grew older, reached a plateau, and then declined. 
The F value acted in a similar manner. The within treatment variance was greater 
for birds fed sub-optimal diets than for those fed optimal diets. When determining 
differences among strains, the highest F values occurred when birds were given the 
optimal diets. In nutritional research the “best’’ animals, in terms of producing 
the most sensitive experiment were those with highest nutritional requirements. 


NEETI R. BOHIDAR (lowa State University, Ames, Iowa). Monte Carlo 


ue Investigations of the Effect of Linkage on Selection. 


A Monte Carlo investigation was undertaken to study the effect of linkage on 
the efficiency of selection. A program for the ‘“Cyclone’”’, the high speed digital 
computer located at lowa State University, was written to accommodate any combi- 
nation of the following facets: type of initial population, dominance, epistacy, linkage 
relations, selection intensity and some type of selection. The biological parameters 
involved in the actual numerical work were as follows: two types of population, 
repulsion and coupling; four types of dominance, no dominance, complete dominance 
over dominance, and mixed dominance; three types of character, character expressed 
by males, females and both sexes; three types of truncations, upper extreme, inter- 
mediate and lower extreme; two types of selection intensities, 20/40 and 5/40, 
and nine types of linkage relations, .5, .3, .1, .03, .015, .007, .003, t; and t, where 
t, stands for tight linkage in female and ¢,, stands for tight linkage in male. Graphical 
method of representation was resorted to, to provide a clear picture of the situation. 
The results gave definite indications of the roles of these factors on the effects of 
selection and offered a comparative study of the effects of the combination of dif- 
ferent facets of interest on the efficiency of selection. 
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H. D. BRUNK (University of Missouri, Columbia, Mo.). A Statistic’Related 


775 
to Kolmogorov’s. 


Let F, denote the empiric distribution function of a random sample of size n 
from a population. In connection with distributions on a circle, Kuiper introduced 
the statistic max, [F(x) — Fn(z)] — minz [F(z) — F,(x)| Undag. Math.-Proc. Kon. 
Nederlandse Akad. Wet., Ser. A, 63 (1960) 38-47], for testing the hypothesis that 
the population has distribution function F. The statistic studied here is essentially 
Kuiper’s: it bears the same relationship to Kuiper’s as does Pyke’s [Ann. Math. 
Stat., Vol. 30 (1959), pp. 568-576] modification of Kolmogorov’s to Kolmogorov’s 
itself. The statistic occupies a position intermediate between Kolmogorov’s [Inst. 
Ital. Attuari, Giorn., 4 (1933) 1-11] and Sherman’s [Ann. Math. Stat., 21 (1950) 
339-361], and the corresponding test appears more powerful than Kolmogorov’s 
against certain alternatives (e.g. different scale parameter, for a symmetric distribu- 
tion), and less powerful against others (e.g. different location parameter). Asymptot- 
ically the statistic coincides with that of Kuiper, who gives the asymptotic dis- 
tribution (loc. cit.). A theorem of Sparre Andersen [Skand. Aktuarietidskrift, 36 
(1953) 123-138] makes possible an essential simplification of the problem of deter- 
mining the distribution for finite sample size. After this simplification, methods 
developed for Kolmogorov’s statistic by Kolmogorov, Feller, Dempster and others 
can be used. Tables are in preparation. 


JAMES L. LEITCH (Laboratory of Nuclear Medicine and Radiation Biology, 
776 School of Mecidine, University of California, Los Angeles, Cal.). Radiation 
Effects and Their Statistical Evaluation: Introduction. 


With the ever-increasing interest in the biological effects of ionizing radiation, 
a review of the various facets in this field is considered as timely. It is necessary 
that all parameters, influenced by (1) radiation characteristics, (2) biological char- 
acteristics, (3) pre-irradiation conditions, (4) post-irradiation conditions, (5) treat- 
ment (protective) factors, and (6) criteria for evaluation of the biological effects, 
must be considered in any statistical appraisal. Initially these factors will be 
presented in outline form and discussed in more detail in subsequent papers. 


JAMES L. LEITCH (Department of Nuclear Medicine and Radiation Biology, 
777. School of Medicine, University of California, Los Angeles, Cal.). Biological 
Factors in Radiation Effects. 


A general review of literature data will be presented on the relationship between 
various biological factors and the radiation syndrome. Special emphasis will be 
placed on the treatment (protective) factors which may modify the basic syndrome. 
New experimental data involving X-ray effects on mice will be presented relative 
to the following parameters: (a) biological variability, (b) a possible seasonal effect, 
(c) cage effect and (d) interrelationship between dose rate and protection. An 


initial approach to the statistical evaluation of radiation experiments will be discussed. 


; FRANK J. MASSEY, Jr. and CARL E. HOPKINS (School of Public Health, 
778 University of California, Los Angeles, Cal.). Tables of Exact Sampling 


Distribution of R2. 


The exact sampling distribution of the multiple correlation coefficient A? has 
been computed on the IBM 709 of Western Data Processing Center at UCLA and 
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tabled for various magnitudes of the parameter, sample size, and number of variables. 
The density function and the cumulative distribution function are given in intervals 
of 0.01 in R?, the sample coefficient. 

These distributions should be useful in applications requiring significant tests 
and confidence intervals for sample R?’s where the hypothesis is non-zero, and for 
determining the power and sample size requirements of projected studies, such as 
epidemiologic surveys, in which the population R? is expected to be non-zero. 


ORSELL M. MEREDITH (Laboratory of Nuclear Medicine and Radiation 
Biology, School of Medicine, University of California, Los Angeles, Cal.). 
Comparison of Dose-Rate Effects on CF-1 Mouse: Mortality Between 250 
KVP X-Rays and Cobalt-60 Gamma-Rays. 


779 


A comparison of acute radiation mortality response with dose rates ranging 
from 2-170 r/min has been performed with CF, female mice. Analyses of response 
have been based upon methods of probit analysis elaborated by Finney. A rapid 
increase in the LD50(30) level was observed for either Co® y-rays or 250 KVP X-rays 
as the exposure dose rate was reduced below 20 r/min. On the other hand, little 
change in LD5o (30) was observed with increase of exposure dose rate above 20 r/min. 
For either type of radiation source there was no significant deviation from parallelism 
when all of the probit dose response curves were considered. In addition, study 
has been made of the comparative applicability to Co y-rays and 250 KVP X-rays 
results of various mathematical models which have been proposed for radiation 
mortality response. 


STANLEY R. PERSON (Laboratory of Nuclear Medicine and Radiation 
Biology, School of Medicine, University of California, Los Angeles, Cal.). 
Relationship Between Physical Characteristics of Radiation and its Biological 
Effect. 


780 


Physical factors affecting the radiation sensitivity of whole animals will be 
discussed. Factors to be discussed center around changes in the radiation sensitivity 
breught about by use of radiation of different qualities. Data from the literature 
will be presented on the RBE of radiations that give rise to different rates of energy 
loss. A discussion of current dosimetry methods and factors affecting accurate 
dosimetry will be given. 


A. D. WIGGINS (General Electric Company, Richland, Wash.). Further 


781 A 
Aspects of a Multicompartment Migration Model. 


The present paper represents an effort to extend the results of an earlier paper 
[Wiggins (1960). On a Multicompartment Migration Model With Chronic Feeding. 
Biometrics 16:4, 642-58] in several directions. First, the earlier model is generalized 
to include the possibility of an independent source or “feeding function” within each 
compartment. Second, a result of S. Bernstein [P. Lévy (1948). Processus Stochas- 
tiques et Mowvement Brownien. Gauthier-Villars, Paris. p. 64], namely the derivation 
of the one-dimensional diffusion equation of probability theory starting from a 
stochastic differential equation, is extended to K dimensions. The K-dimensional 
diffusion equation corresponding to the present migration model is then derived 
and an attempt is made to solve the equation in two dimensions. 
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Two numerical examples resulting from the application of the estimation 
procedure of the earlier Biometrics paper to experimental data are presented. The 
resulting graphs are plotted and compared with a plot of the experimental points. 


CORRECTIONS 


J. A. Nelder (1961]. The Fitting of A Generalization of the Logistic Curve. Biometrics 
17, 89-110. 


In the above paper the following reference on page 110 was omitted: 

Skellam, J. G., Brian, M. V., and Proctor, J. R. [1959]. The simultaneous growth 
of interacting systems. Acta Biotheoretica 13, 131-144. 

Also the algebraic expression in the heading of Table 1, (p. 91), should read 


[1/ + e7)). 


R. C. Elston [1961]. On Additivity in the Analysis of Variance. Biometrics 17, 
209-19. 


The fourteenth line on page 215 should read: “null hypothesis, and, if condition 
(5) holds, provides an approximate.” 


THE BIOMETRIC SOCIETY 


Brazilian Region 


REPORT ON THE MEETING OF THE BRAZILIAN REGION 


On March 29, 1961, the Brazilian Region of the Biometric Society held its 
annual meeting at the Department of Statistics of the Faculty of Hygiene and 
Public Health of the University of Sio Paulo, SAo Paulo, Brazil. 

The first part of the meeting was devoted to the presentation of the following 
scientific papers: 


“Calculo da disténcia morfolética em 4 grupos de Laelia Sp.” by Rolando 
Vencovsky. 

“Relagdo entre a Andlise Tradicional de Experimentos Fatoriais de Adubagio 
de 3% e a Superficie de Resposta Respectiva’”’ by F. Pimentel Gomes. 

“Andlise de um experimento sdébre aplicacdo de micro-elementos em cafeeiros” 
by H. Vaz de Arruda. 

“Correction for bias introduced by a transformation of variables” by Jerzy 
Neyman, as a visitor at the University of Sdo Paulo. 

The second part was devoted to the annual report for 1960 and the election 
of 1961 regional officers. The report and accounting demonstration for 1960 were 
submitted to the attendant members and approved. 

In accordance with the results of the election, the names of new officers submitted 
for the approval of the Council of the Society are: 


President—Adolpho M. Penha 
Treasurer—Americo Groszman 
Secretary—Elza Berqué 
Council Members: 
Pompeu Meméria 
José T. A. Gurgel 
Frederico Pimentel Gomes 
Ruy Aguiar da Silva Leme 
Frederico G. Brieger 
Geraldo Gracia Duarte. 


British Region 


At a meeting held on April 18th, 1961, the following papers were read and 
discussed: 

G. Harrington: Studies of Visual Judgments of Quality in Bacon. 

J. M. Tanner and M. J. R. Healy: Assessment of Maturity from X-rays of 
the Wrist and Hand. 
W.N.A.R. 

ANNUAL MEETING 

The annual meeting of the WNAR Biometric Society was held at the University 

of Washington, Seattle, Washington, on June 14, 15, and 16, 1961, in conjunction 
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with meetings of the Institute of Mathematical Statistics, the Section on Physical 
and Engineering Sciences of the American Statistical Association, the American 
Mathematical Society, and the Institute of Management Sciences; and a special 
“Symposium on Convexity” sponsored by the American Mathematical Society. 


Program 


Wednesday, June 14, 1961 
7:30-10:00 p.m.—Informal Inferential Procedures (IMS, ASA-SPES and WNAR) 

Chairman: A. M. Mood, C.E.I.R., Inc., Los Angeles, California 

1. “The Future of Data Analysis.” J. W. Tukey, Princeton University and 
Bell Telephone Laboratories, Murray Hill, New Jersey. 

2. “Some Sequences of Fractional Replicates.”’ C. Daniel, New York. 

3. “Graphical Methods for Internal Comparisons in Multi-response Experi- 
ments.” M. B. Wilk and R. Gnanadesikan, Bell Telephone Laboratories, 
Murray Hill, New Jersey. 


Thursday, June 15, 1961 


8:30-10:00 a.m.—Stochastic Processes in Biology 
Chairman: A. T. Bharucha-Reid, University of Oregon 
1. “Further Aspects of a Multi-Compartment Migration Model.” A. D. 
Wiggins, General Electric Company. 
2. “A Statistic Related to Kolmogorov’s.” H. D. Brunk, University of Missouri. 


10:30 a.m.—12:30 p.m.—Planning and Analysis of Experiments (ASA-SPES and 

WNAR) 

1. “The Consideration of Variance and Bias Errors in the Selection of a Response 
Surface Design.” G. E. P. Box, University of Wisconsin, and N. R. Draper, 
Mathematics Research Center, U. 8S. Army. 

2. “Orthogonal Main-Effect Plans.” Sidney Addelman, Research Triangle 
Institute. 

3. “Asymmetric Factorial Designs and the Direct Product.” B. Kurkjian, 
Diamond Ordnance Fuze Laboratories, and M. Zelen, University of Mary- 
land and National Bureau of Standards. 


2:15-3:45 p.m.—Radiation Effects and their Statistical Evaluation 

Chairman: James L. Leitch, Laboratory of Nuclear Medicine and Radiation 
Biology, U.C.L.A. 

1. Introduction 

2. “Relationship between Radiation Characteristics and Radiation Effects.” 
S. R. Person, Laboratory of Nuclear Medicine and Radiation Biology, 
U.C.L.A. 

3. “Biological Factors Involved in Radiation Effects.” James L. Leitch. 

4. “Comparison of Dose-Rate Effects on CF-1 Mouse Mortality Between 250 
KVP X-Rays and Cobalt-60 Gamma Rays.” Orsell M. Meredith, Laboratory 
of Nuclear Medicine and Radiation Biology, U.C.L.A. 


Friday, June 16, 1961 


8:30-10:30 a.m.—Estimation (IMS and WNAR) 
1. “Combining Information in Incomplete Blocks.” F. A. Graybill, Colorado 
State University. 
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to 


_ “Estimation with Minimum Mean Square Error.” H. O. Hartley, Iowa 
State University. 

3. “Remarks on the Efficiency of Unbiased Estimation with Auxiliary Variates.”’ 

W. H. Williams, Bell Telephone Laboratories, Murray Hill, New Jersey. 


1:00-2:30 p.m.—Contributed Papers 


Chairman: A. D. Wiggins, General Electric Company 

1. “Monte Carlo Investigation of the Effect of Linkage on Selection.” N. R. 
Bohidar, Utah State University. 

2. “Tables of Exact Sampling Distribution of R?.” F. J. Massey, Jr. and Carl 
E. Hopkins, U.C.L.A. 

3. “Various Levels of Riboflavin and the Sensitivity of Experiments on Growth.” 
W. A. Becker and L. R. Berg, Washington State University. 


CHANGES IN MEMBERSHIP 
(January 15—July 15, 1961) 
Changes of Address 


Mr. Ross W. Adams, 1251 Hawthorne, Ames, Iowa, U.S.A. 

Mr. B. L. Adkins, Statistics Department, University of New England, Armidale, 
N.S.W., Australia. 

Miss Margaret F. Allen, School of Aviation Medicine, USAF, Brooks AFB, Texas, 
U.S.A. 

Mr. H. A. J. Amand, 21 Place Cardinal Mercier, Rizensart, Belgium. 

Mr. Donald W. Bailey, Cancer Research Institute, Univ. of California Medical 
Center, San Francisco 22, California, U. S. A. 

Mr. B. O. Bartlett, Agricultural Research Council, Letcombe Regis, Wantage, 
Berkshire, England. 

Dr. Glenn E. Bartsch, Department of Preventive Medicine, Western Reserve 
University, Cleveland 6, Ohio, U.S. A. 

Mrs. Hannelore Beyer, Haertelstr. 16-18, Leipzig C 1, Germany. 

Mr. Paul Blunk, 4616 Plantation Drive, Fair Oaks, California, U. 8. A. 

Mr. W. F. Bodmer, Department of Genetics, University of Cambridge, Cambridge, 
England. 

Mr. Roger L. Bollenbacher, 860 Hiawatha Drive, Elkhart, Indiana, U.S. A. 

M. Jacques Bredas, 24 rue Grand Bry, Montiguy-Le-Tilleul, Belgium. 

Dr. Leroy 8. Brenna, The Texas Company, 12th Floor Chrysler Bldg., New York, 
Is Vaas Wis es ak 

Dr. A. Brown, Department of Mathematics, Australian National University, 
Canberra City, A.C.T., Australia. 

Dr. Robert V. Brown, Box 181, Edgewood, Maryland, U.S. A. 

Dr. W. R. Buckland, The Exonomist Intelligence Unit, St. James, London, S8.W. 1, 
England. 

Mr. A. Burny, 77 avenue des Combattants, Gembloux, Belgium. 

Mr. Lyle D. Calvin, Department of Statistics, Oregon State University, Corvallis, 
Oregon, U.S. A. 

Dr. A. H. Carter, 1 Hackin Place, Fairfield, Hamilton, New Zealand. 

Mr. Melvin W. Carter, Department of Mathematics and Statistics, Purdue Uni- 
versity, Lafayette, Indiana, U. 8. A. 

Mr. David B. Christian, 33 Crestview Drive, Whitesboro, New York, U.S. A. 
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Mr. Frank B. Cramer, 17331 Tribune Street, Granada Hills, California, U.S. A. 

Mr. Mare Dalebroux, c/o Instituto di Genetica, Universita di Pavia, Pavia, Italy. 

Dr. James G. Dare, Department of Pharmacy, University of Queensland, Brisbane, 
Australia. 

Dr. Richard J. Daum, 3900 Hamilton Street, Hyattsville, Maryland, U.S. A. 

Miss M. E. Davis, Department of Agriculture, Box 1500, Wellington, New Zealand. 

Mr. R. De Coene, 103 rue Edith Cavell, Bruxelles 18, Belgium. 

M. Jean Dejardin, ORSTOM, 24 rue Bayard, Paris (VIII®) France. 

Mr. R. Delhaye, 36 avenue Jean Van Raclen, Bruxelles 16, Belgium. 

Dr. Daniel B. De Lury, Department of Mathematics, University of Toronto, 
Toronto 5, Canada. 

Mr. A. Deville, 2 rue Middelbourg, Boitsfort, Belgium. 

Mr. H. M. Dicks, Department of Agriculture, J. S. Marais Bldg., Stellenbosch, 
South Africa. 

M. Pol Dineur, Golsinnes, Bossiere par Masy, Belgium. 

Miss Irene L. Doto, Communicable Disease Center, 2082 West 38th Street, Kansas 
City 3, Kansas, U.S. A. 

Dr. D. B. Duncan, Department of Biostatistics, The John Hopkins University, 
Baltimore 5, Maryland, U.S. A. 

Mr. Steve A. Eberhart, Department of Agronomy, Iowa State University, Ames, 
Iowa, U.S. A. 

Dr. F. Ectors, 111 rue du Centre, Assesse, Belgium. 

Mrs. Polly Feigl, Olof Skotkonungsgatan 66, Goteborg 8. Sweden. 

Dr. Heinz Fink, Morgengraben 14, Koeln-Stammheim, Germany. 

Mr. Robert Fitzpatrick, 5229 21st Avenue, N.E., Seattle 5, Washington, U. 5. A. 

Dr. Henry R. Fortmann, Agricultural Experiment Station, Pennsylvania State 
University, University Park, Pennsylvania, U.S. A. 

Mr. Robert A. Harte, Am. Soe. of Biological Chemists, 9650 Wisconsin Avenue, 
Washington 14, D. C., U.S. A. 

Prof. Dr. Jo Hartung, Ruehlmannstr. 8, Hannover, Germany. 

Dr. Don W. Hayne, Patuxent Wildlife Research Center, Laurel, Michigan, U.S. A. 

Prof. Dr. J. Hemelrijk, Keizer Karelweg 83, Amstelveen (N.H.), Netherlands. 

Mr. Jean Henry, 1 rue Defacqs, Bruxelles, Belgium. 

Dr. Paul G. Homeyer, C-E-I-R, Inc., 11753 Wilshire Blvd., Los Angeles 25, California, 
Wi Saas 

Dr. Carl E. Hopkins, School of Public Health, University of California, Los Angeles 
24, California, U.S. A. 

Mr. Paul V. Hurt, Deerfield, Wisconsin, U. 8. A. 

Dr. Peter Ihm, ¢/o Euratom, Casella postale 191, Como, Italy. 

Mr. Arthur G. Itkin, 1870 Clayton Road, Abington, Pennsylvania, U. 5. A. 

Mr. Willaim G. H. Ives, Forest Biology Laboratory, Box 6300, Winnipeg, Manitoba, 


Canada. 
Dr. Dubodh K. Jain, Botany Division, Indian Agricultural Research Institute, 


New Delhi 12, India. 

Mr. Eugene A. Johnson, Industrial Engineering, University of Minnesota, 
Minneapolis, Minnesota, U.S. A. 

Dr. med. Herbert Jordan, Haus Tusculum, Bad Elster, Germany. 

Prof. Dr. Hans Kelleher, Ludwigstr. 18, Munchen, Germany. 

Mr. Thomas R. Konsler, Mt. Hort. Crops Research Station, Route 2, Fletcher, 
North Carolina, U. 8. A. 

Dr. Paul Kuehne, Nachodstr. 19, Berlin W 30, Germany. 
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Mr. Thomas E. Kurtz, Mathematics Department, Dartmouth College, Hanover, 
New Hampshire, U. 5. A. 

Mrs. Katherine B. Ladd, 4 Woodside Drive, South Burlington, Vermont, U.S. A. 

Dr. R. J. Ladd, Physiology Department, University of Queensland, Brisbane, 
Australia. 

Dr. G. E. J. Lambelin, 231 Chaussee d’Alsemberg, Bruxelles 18, Belgium. 

Dr. Lonnie L. Lasman, A. J. Wood Research Corporation, 42 South 15th Street, 
Philadelphia 2, Pennsylvania, U.S. A. 

Mr. Jay P. Leary, Jr., 1253 S. Longwood Avenue, Los Angeles 19, California, U.S. A. 

M. Jerome Lejeune, Institut de Progenese, 15 rue de VEcole Medecine, Paris (VI°), 
France. 

Mr. Robert Lichter, Ragis-Stat. Heidehof, Brockhoefe, Kr. Velzen, Germany. 

Dr. D. Lindley, Department of Statistics, University College of Wales, Aberystwyth, 
Cards, Wales. 

Mr. George F. Lunger, P. O. Box 583, Camden, New Jersey, U.S. A. 

Marcel J. W. Luttgens, 7 rue Van Oost, Brussels 3, Belgium. 

Prof. Dr. A. Maede, Grasse Steinstr. 81 Halle/Saale, Germany. 

Mr. James E. Mangan, 7443 N. Claremont Avenue, Chicago 45, Illinois, U. S. A. 

Mr. Stuart H. Mann, 511 W. University, Champaign, Illinois, We 5 ke 

Mr. Robert Marechal, J. M., 29 rue Docqgs, Gembloux, Belgium. 

Dr. M. Maricz, 16 avenue des Abeilles, Ixelles, Belgium. 

Mr. T. J. Marynen, 11 ring Laan, Berchem-Anvers, Belgium. 

Mr. John W. Mayne, Operational Research Group, Defence Research Board, 
Ottawa, Canada. 

Mr. Judson U. McGuire, European Parasite Laboratory, 20 bis rue Sadi Carnot, 
Nanterre (Seine) France. 

Mr. Martin Menzi, Kreuzrain, Hedingen (ZH) Switzerland. 

M. Philippe Merat, 20 rue de Louvre, Viroflay (S. and O.) France. 

Mr. Donald L. Meyer, 1646 S. State Street, Syracuse 5, New York, U.S. A. 

Dr. A. M. Mood, C-E-I-R, Inc., 11753 Wilshire Blvd., West Los Angeles, California, 
Waits ae 

Prof. Sigeiti Moriguti, Department of Statistics, Stanford University, Stanford, 
California, U.S. A. 

Dr. M. B. Mueller, Plastics Division, Allied Chemical Corporation, Glenolden, 
Pennsylvania, U.S. A. 

Dr. Hugo Muench, Jr., 100 Memorial Drive, Cambridge 42, Massachusetts, U.S.A. 

Dr. Karl Heinz Muller, F. Schelling Str. 3, Jena, Germany. 

Mr. August Carl Nelson, Jr., 1015 Green Street, Durham, North Carolina, U.S. A. 

Dr. A. R. G. Owen, Department of Genetics, University of Cambridge, Cambridge, 
England. 

Dr. Erich Panse, c/o Lochow-Pettkus CmbH, Bergen Krs. Celle, Germany. 

Dr. Benjamin Pasamanick, Columbia Psychiatric Institute and Hospital, Columbus 
10, Ohio, U.S. A. 

Dr. Mary Ellen Patno, 2451 Yost Boulevard, Ann Arbor, Michigan, U.S. A. 

Mr R. Pierlot, 5 avenue des Phalenes, Bruxelles 5, Belgium. 

M. Jacques Poly, Service de genetique animale, 16 rue de |’Estrapede, Paris (V°) 
France. 

Mr. Joe Powell, 8245 Park Place Blvd., Apt. 6, Houston 17, Texas, U.S. A. 

Mr. Wolf Prensky, Department of Biology, Brookhaven National Laboratory, 
Upton, L.I., New York, U.S. A. 
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Mr. Lester W. Preston, Jr., A. H. Robins Co., Inc., 1407 Cummings Drive, Richmond 

20, Virginia, U.S. A. 

Mr. Dieter Rasch, Freiligrathstr. 14, Rostock, Germany. 

Dr. Arthur Ringoet, 1 avenue du Congo, Bruxelles 5, Belgium. 

Mr. Erwin Roth, Jaminstr. 30, Erlangen, Germany. 

Dr. O. K. Sagen, 210 Fifth Street, S.W., Washington, D. C., U.S. A. 

Mr. Wilfred Salhuana, Universidad Agraria, Apartado 456, La Molina, Lima, Peru. 

Dr. Hellmut Schmalz, Berliner Str. 2, Hohenthurm-Saalkreis, bei Haale/Saale, 
Germany. 

Cand. Math. Berthold Schneider, Marburger Str. 18, Giessen, Germany. 

Dr. Francesco Sella, Instituto di Genetica, Via S. Epifanio 14, Pavia, Italy. 

Mr. William Seyffert, MPI f. Zuechtungsforsch, Post Bickendorf, Koeln-Vogelsang, 
Germany. 

Dr. Robert R. Shrode, 111 E. State Street, Sycamore, Illinois, U. S. A. 

Dr. Donald F. Starr, Route 1, Box 321 A, Grand Island, Nebraska, U. 8. A. 

Mr. Otto Steiner, St. Wenderlstr. 50, Braunschweig, Germany. 

Mr. N.S. Stenhouse, Division of Math. Statistics, C.S.LR.O., University of Adelaide, 
Adelaide, S. Australia. 

Dr. Klaus J. Stern, Manhagener Allee 84, Schmalenbeck/Ahrensburg, (Holst.) 
Germany. 

Miss Elizabeth Street, 231 West 13th Street, New Word il, ING Wh Wet AG 

Mr. R. C. Tomlinson, 74 Gayton Road, Harrow, Middlesex, England. 

Dr. M. Torfs, 99Bd Lambermont, Bruxelles 3, Belgium. 

Mr. Gerard Torreele, 5 Slachthuesstraat, Niouwpoort, Belgium. 

Dr. Jean Vacher, 22 avenue Grammont, Tours (Indre-et-Loire) France. 

M. Raymond Van Den Driessche, 42 rue du Friquet, Bruxelles 17, Belgium. 

Mr. A. Van Parijs, 128 rue de la Loi, Bruxelles 4, Belgium. 

Mr. Thierry Waffelaert, 5 rue J. B. Verlooy, Anvers, Belgium. 

Prof. Dr. Erna Weber, Schenkestr. 8c, Berlin-Karlshorst, Germany. 

Mr. Irving Weiss, The Mitre Corporation, Bedford, Massachusetts, U.S. A. 

Mr. Robert White, Box. 241, Dugway, Utah, U.S. A. 

Mr. Henry K. C. Woo, E-TAI Ltd., 95 Liberty Street, New York 6, IN May WE Iss AN 

Dr. Gunter Wricke, ueber Lingen/Ems, Klausheide, Germany. 


New Members 


At Large 


Dr. Martin Eugene Dehousse, University of Ruanda-Urundi, B.P. 1550, Usumbura- 


Burundi, Africa. 
Mr. Hong Suk Lee, Last Crop Section, Agricultural Experiment Station, Suwon, 


Korea. 
Mr. Heliodoro Miranda M, Inter-American Institute of Agricultural Sciences, 


Turrialba, Costa Rica. 
Ing. Luis A. Montoya-Armas, Instituto Interamericano de Ciencias Agricolas, 


Turrialba, Costa Rica. 


Australia 
Prof. John Henry Bennett, Department of Genetics, University of Adelaide, 


Adelaide, South Australia. 
Dr. B. Diamantis, 2 Raleigh Street, Windsor 8. 1, Victoria, Australia. 
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Mr. Alan E. Stark, Div. of Fisheries and Oceanography, C.S.I.R.O., P. O. Box 21, 
Cronulla, N.S.W., Australia. 


British 

Dr. P. F. D’Arcy, Pharmacology Department, Allen and Hanburys Ltd, Ware, 
Herts., England. 

Mr. J. P. Evenson, Wellcome Research Laboratories, Langley Court, Beckenham, 
Kent, England. 

Mr. P. Hallam, 67 Tomline Road, Felixstowe, Suffolk, England. 

Mr. D. J. Harberd, Plant Breeding Station, Pentlandfield, Roslin, Midlothian, 
England. 

Mr. P. Holgate, “High Canons’’, Well End, Barnet, Herts., England. 

Mr. G. J. Knight, Wellcome Research Laboratories, Langley Court, Beckenham, 
Kent, England. 

Mr. Ian McDonald, 6 Deeside Place, Aberdeen, Scotland. 

Mr. F. M. O’Carroll, 12 Dundela Avenue, Sandycove, Co. Dublin, Ireland. 

Prof. W. T. Williams, Botany Department, The University, Southampton, England. 


Belgian 


Mr. Joseph J. Gabriel, 54 avenue Dr Decroly, Uccle-Bruxelles 18, Belgium. 

Mr. Georges Geortay, 26/60 Avenue Georges Truffaut, Liege, Belgium. 

Dr. L. Goeminne, 98 Chaussee de Gand, Deinze, Belgium. 

Dr. Paul Janssen, Department de Recherches des Laboratories Pharmaceutique, 
Turnhout, Belgium. 

Dr. A. Jeurissen, Sanatorium, Buizingen, Belgium. 

Mr. Francois R. Martin, 5 Rue Caroly, Ixelles, Bruxelles, Belgium. 

Mr. Andre Pieteres, Brusselse steenweg 407, Gentbrugge, Belgium. 

Mr. Francois Sterckx, Yangambi I, B.P. 1035, Stanleyville, Belgian Congo. 

Dr. Robert Van Vaerenbergh, 6 avenue du Saleil, Knokke, Belgium. 


ENAR 

Dr. Helen Abbey, 615 N. Wolfe Street, Baltimore 5, Maryland, U.S. A. 

Dr. Elliott T. Adams, P. O. Box 47, Upham’s Corner Station, Boston 25, Massa- 
chusetts, U. 8. A. 

Dr. Daniel J. Baer, 2877 Valentine Avenue, New York 58, N. Y., U.S. A. 

Dr. Harle V. Barrett, Department of Preventive Medicine, The Creighton Univer- 
sity School of Medicine, Omaha 2, Nebraska, U.S. A. 

Dr. A. F. Bartholomay, 12 Upland Road, Wellesley, Massachusetts, U. S. A. 

Miss Virginia B. Berry, Department of Mathematics, University of British Columbia, 
Vancouver B.C., Canada. 

Mr. Paul V. Blair, Populations Genetics Institute, Purdue University, Lafayette, 
Indiana, U.S. A. 

Dr. John R. Braunstein, 2123 Luray Avenue, Cincinnati 6, Ohio, U.S. A. 

Mr. Franklin W. Briese, 7500 Olivers Avenue South, Minneapolis 23, Minnesota, 
Uses Ae 

Mr. Paul M. Cohen, Technical Operations Inc., Box 37, Fort Monroe, Virginia, 
UP SA% 

Mrs. Elizabeth F. Davis, Hazleton Laboratories Inc., Box 30, Biometrical Unit, 
Falls Church, Virginia, U.S. A. 
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Dr. Roscoe A. Dykman, Division of Behavorial Sciences, University of Arkansas, 
Little Rock, Arkansas, U.S. A. 

Mr. John R. Flood, 94 Parkway Road, Bronxville, New York. U.S. A. 

Dr. D. H. Fogel, 1380 Bedford Street, Stamford, Connecticut, U.S. A. 

Dr. Seymour Geisser, Biometrics Branch, NIMH, Bethesda, Maryland, U.S. A. 

Dr. William T. Ham, Jr., Box 877, Medical College of Virginia, Richmond, Virginia, 
WS: Ac 

Mr. Herman B. Hamot, 835 Forbes Avenue, Perth Amboy, New Jersey, U.S. A. 

Dr. Dewey L. Harris, Statistical Laboratory, Iowa State University, Ames, Towa, 
U.S. A. 

Dr. Edwin Hendler, 415 Brentwood Road, Havertown, Pennsylvania, U. 8. A. 

Dr. Homer C. Jamison, 1919 Seventh Ave., 8., Birmingham 2, Alabama, U.S. A. 

Mr. Denis J. Kelleher, Department of Animal Husbandry, Iowa State University, 
Ames, Iowa, U.S. A. 

Dr. Samuel J. Kilpatrick, Statistical Laboratory, Iowa State University, Ames, 
Towa, U.S. A. 

Mr. Eugene Legler, Tennessee Game and Fish Commission, Cordell Hull Building, 
Nashville, Tennessee, U. 5. A. 

Dr. Guillermo Llanos-Bejarano, 608 N. Collington Avenue, Baltimore 5, Maryland, 
USS: A: 

Mrs. Ruth B. Loewenson, 4844 Xerxes Avenue, 5., Minneapolis 10, Minnesota, 
U.S. A. 

Dr. Josiah Macy, Jr., Department of Physiology, Albert Hinstein College of Medicine, 
New York 61, N. Y., U.S. A. 

Mr. Robert H. Miller, Dairy Herd Improvement, Agricultural Research Service, 
USDA, Washington 25, D. C., U.S. A. 

Dr. Richard Moore, American National Red Cross, 18th and E., N.W., Washington, 
(DRTC hs WU Shee 

Dr. Donald F. Morrison, Biometrics Branch, National Institutes of Health, Bethesda 
14, Maryland, U.S. A. 

Mrs. Sue W. Nealis, 6657-24th Place, Riggs Manor, Hyattsville, Maryland, U.S. A. 

Dr. Masatoshi Nei, Department of Genetics, North Carolina State College, Raleigh, 
North Carolina, U.S. A. 

Mr. Gill Nestel, 1219 S. State Street, Ann Arbor, Michigan, U.S. A. 

Mr. Marcelo M. Orense, Department of Experimental Statistics, North Carolina 
State College, Raleigh, North Carolina, U.S; A: 

Mr. James G. Osborne, Forest Service, USDA, South Building, Washington, D. C., 
We SwAS 

Dr. Bernard S. Pasternack, New York University Medical Center, 550 First Avenue, 
New York 16, N. Y., U.S. A. 

Dr. H. V. Pipberger, 7439 Little River Pike, Annandale, Virginia, U.S. A. 

Mr. P. V. Rao, Department of Mathematics, University of Georgia, Athens, Georgia, 
(Uy Sa Ae 

Mr. Searle B. Rees, 9 Strathmore Road, Brookline, Massachusetts, U. 8. A. 

Dr. Richard D. Remington, School of Public Health, University of Michigan, 
Ann Arbor, Michigan, U. 5. A. 

Mr. J. C. Richards, Jr., The Standard Oil Company, Midland Bldg., Cleveland 12, 
Ohio, U.S. A. 

Mr. Richard H. Richardson, 156 Williams Hall, N. C. State College, Raleigh, 
North Carolina, U. 8. A. 
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Mr. Donald C. Riley, American Statistical Association, 1757 K Street, N.W., 
Washington 6, D. C., U.S. A. 

Dr. Ralph Rossen, L. E. Phillips Psychobiological Research Division, Mt. Sinai 
Hospital, Minneapolis 4, Minnesota, U.S. A. 

Mr. Raymond E. Roth, Department of Mathematics, St. Bonaventure University, 
St. Bonaventure, New York, U.S. A. 

Mr. Jagdish S. Rustagi, College of Medicine, University of Cincinnati, Cincinnati 
19, Ohio, U.S. A. 

Mr. Darshan Lal Sachdeva, 308 South Macomb, Tallahassee, Florida, U. S. A. 

Mr. Henry E. Schaffer, Department of Genetics, N. C. State College, Raleigh, 
North Carolina, U.S. A. 

Mr. Marvin A. Schneiderman, Biometry Branch, National Cancer Institute, 
Bethesda 14, Maryland, U.S. A. 

Mr. Wilfred M. Schutz, Department of Genetics, N. C. State College, Raleigh, 
North Carolina, U.S. A. 

Mr. Edward Selig, 130 Beach Avenue, Mamaroneck, New York, U.S. A. 

Mr. John L. Seliskar, U. S. Forest Service, USDA, South Building, Washington 25, 
1D), Ol, We ish AN 

Dr. C. W. Sheppard, Department of Physiology, University of Tennessee, Memphis 
3, Tennessee, U.S. A. 

Mr. Robert E. Sherman, 4143 Blaisdell Avenue S., Minneapolis 9, Minnesota, U.S. A. 

Mr. Lawrence E. Sly, Jr., 202 West 9th Avenue, Tallahassee, Florida, U. S. A. 

Mr. Herbert Stern, Jr., 737 Carol Marie Drive, Baton Rouge 6, Louisiana, U.S. A. 

Dr. Claire M. Vernier, Department of Medicine and Surgery, VA Central Office, 
Washington, D. C.. U.S. A. 

Dr. Richard L. Willham, 3624 Ross Road, Ames, Iowa, U.S. A. 

Mr. Ralph P. Winter, 5712-38th Avenue South, Minneapolis 17, Minnesota, U.S. A. 

Mr. Donald F. Wilson, 6127 Westchester Drive, Washington 22, D. Gey, UES A 

Dr. Charles Wunder, Department of Physiology, State University of Iowa, Iowa 
City, Iowa, U.S. A. 


French 


M. Jean Louis Beaumont, 14 rue Petrarque, Paris 16e, France. 

M. Marcel Brunard, 17 avenue Emile-Deschanol, Paris Ze, France. 

M. Paul Damiani, Institut National de la Statistique, 29 quai Branly, Paris 7e, 
France. 

Mme. Jacqueline Roquet, Docteur en Pharmacie, 22 avenue Victoria, Paris 1°, 
France. 

M. Luu-Mau-Thanh, 19 Boulevard Brune, Paris (XIV°) France. 


German 


Dr. K. H. Barocka, Gartenstr. 6, Einbeck/Hann., Germany. 

Prof. Dr. B. Baule, Nibelungenstr. 63, Graz /Oesterreich, Austria. 

Dr. W. D. Froehlich, Rochusweg 12, Bonn, Germany. 

Dr. H. G. Kmoch, Inst. f. Pflanzenbau, Katzenburgweg 5, Bonn /Rhein, Germany. 
Dr. J. Krippl, Nordendstr. 14, Munchen 13, Germany. 

Dr. R. Krussmann, Anthropol. Institut, University, Mainz, Germany. 

Dr. Gunter Wricke, Post Nordhorn, Fa. v. Lochow-Petkus, Klausheide, Germany. 
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India 


Mr. Prem Narain, Animal Genetics Division, Indian Veterinary Research Institute, 
Izatnagar, U.P., India. 

Mr. G. Narasimharao, Asst. Physiologist, Sugarcane Research Station, Anakapelle, 
India. 

Mr. J. S. Ramaratnam, 293/7 Suifabad Lane, Khairabad, Hyderbad, India. 


Japan 


Mr. Masaki Horie, Haraikata-machi 9, Shinjyuku-ku, Tokyo, Japan. 

Mr. Kiyoo Kimura, Department of Mathematics, Mie Prefectural University, 
11 Ootani-cho, Tsu Mie, Japan. 

Mr. Kozuo Kitamura, Saitama Prefectural Agricultural Experiment Station, Ageo 
City, Saitama, Prefecture, Japan. 

Mr. Masahiko Sugimura, Kumamoto Women’s University, Ooemachi, Kumamoto 
City, Japan. 


Netherlands 


Dr. K. J. van Deen, Laboratorium voor Sociale Geneeskunde, Oostersingel 69 I, 
Groningen, Netherlands. 


WNAR 


Mr. Herbert B. Eisenberg, 1329 22nd Street, Santa Monica, California, U. 8. A. 

Dr. William R. Gaffey, 3119 Eton Avenue, Berkeley 5, California, U. 8. A. 

Mr. William B. Owen, Statistical Laboratory, Colorado State University, Fort 
Collins, Colorado, U.S. A. 

Mr. Patrick K. Tomlinson, Calif. State Fisheries, 511 Tuna Street, Terminal Island, 
San Pedro, California, U.S. A. 


NEWS AND ANNOUNCEMENTS 


Members are invited to transmit to their National or Regional Secretary (1f members 
at large, to the General Secretary) news of appointments, distinctions, or retirements, 
and announcements of professional interest. 


NEWS ABOUT MEMBERS 


Victor Chew has accepted a part-time position (starting September 1, 1961) in 
the Department of Biostatistics of the Johns Hopkins University. He will divide 
his time between Baltimore, Maryland and Dahlgren, Virginia, where he is a mathe- 
matical statistician in the Operations Research Branch of the U. 8. Naval Weapons 
Laboratory. 

John J. Gart will spend 1960-61 on a Postdoctoral Research Fellowship at 
Birkbeck College, University of London, while on leave from his position of Assistant 
Professor of Biostatistics at The Johns Hopkins University. 

David G. Gosslee, formerly with the University of Connecticut, has joined 
the Statistics Section of the Mathematics Panel at the Oak Ridge National Lab- 
oratory where he will consult with biologists. 

John Gurland, formerly Professor of the Department of Statistics, Iowa State 
University, recently accepted a position as Professor of the Mathematics Research 
Center, U. S. Army, at the University of Wisconsin. 

Vincent Hodgson has completed this graduate study at the London School of 
Economics and Political Science and joined the faculty of The Department of 
Statistics, The Florida State University, Tallahassee, Florida. 

Maurice G. Kendall, professor of statistics at the University of London and 
president of the Royal Statistical Society, has been appointed to the board of 
C-E-I-R (U.K.) Ltd. He will assume the new post of director for the London-based 
company’s mathematics, statistics and operations research departments, effective 
October 1, 1961 and will then vacate his chair at the university. 

Eugene Lukacs of the Catholic University of America will take a sabbatical 
leave during the academic year 1961/62. He will spend the greater part of this 
time at the Institut Statistique de l’Université de Paris working under an Air Force 
Grant. From April 1962 to July 1962, he will be Visiting Professor at the Swiss 
Federal Institute of Technology. 


INTERNATIONALES SEMINAR 
uber 
biometrische Methoden in der Medizin und 
Genetik 


veranstaltet von der Schweizerisch-ésterreichischen 
Gruppe der Internationalen Biometrischen Gesellschaft 


vom 18. bis 22. September 1961 in Wien. Osterreich. 


Der Zweck des Seminars besteht darin, den Teilnehmern eine grundlegende 
und systematische Ausbildung in der biometrischen Behandlung medizinisch- 
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therapeutischer Probleme (Grundprinzipien des Planens und Auswertens einsch- 
lagiger Versuche, spezielle Versuchsplane und ihre Begrtindung, sequentielle Me- 
thoden) und Fragen der Genetik (Genfrequenzschatzung, Evolutions- und Muta- 
tionsfragen, Vererbung quantitativer Merkmale) zu vermitteln. Beide Themen- 
kreise sind sowohl fiir die Mediziner als auch die Genetiker aktuell und interessant. 
Im Sinne einer wirksamen Instruktionsveranstaltung sind taglich héchstens vier 
Vorlesungen vorgesehen. An mathematischen Grundkenntnissen wird nur der 
tibliche Mittelschullehrstoff vorausgesetzt. Das Lehrprogramm wird von ausge- 
wahlten Spezialisten betreut. Da eine begrenzte Teilnehmerzahl vorgesehen 
ist, ersuchen wir schon jetzt um Vormerkungen beim Ortlichen Tagungs- 
sekretariat: Institut fiir Statistik an der Universitat Wien, Wien I., Rathausstrafe 
19/11/38. 

Der Versand des detaillierten Programmes und des endgiiltigen Anmeldungs- 
formulares erfolgt im Sommer 1961. Teilnehmerbeitrag: sFr 30.—(bzw. 6. 8 180.—, 
DM 30.—). 


Prof. Dr. A. Linder, Genf Prof. Dr. 8. Sagoroff, Wien 
Prof. Dr. H. L. Le Roy, Ziirich Prof. Dr. L. Schmetterer, Wien 
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BIOMETRIE—PRAXIMETRIE 


Tome I, N° 38-4 J uillet-Octobre, 1960 


Editorial 


Utilisation de calculateurs électroniques dans l’analyse d’expériences avec 
répétitions et de groupes d’expériences de méme plan expérimental .F. YATES 


Planification des expériences d’alimentation des vaches laitiéres...H. L. Lucas 
La sélection artificielle suivant des caractéres quantitatifs..... H. L. Le Roy 


La répartition des sexes chez des cucurbitacées hybrides et 1 leur relation 
avec une distribution contagieuse de Neyman................-. F. WEILING 


La puissance du critére F dans l’analyse de la variance de plans en blocs au 
hasard, nomogrammes pour le choix du nombre de répétitions. ...M. Kruus 


Développement des méthodes biométriques et statistiques dans la recherche 
agronomique au Congo Belge et au Ruanda-Urundi.......... J. M. Henry 


Développement des méthodes biométriques et statistiques dans la recherche 
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THE POISSON PASCAL DISTRIBUTION? 


S. K. Karri anp Jonn GurRLAND? 


Iowa State University of Science and Technology 
Ames, Iowa, U.S. A. 


1. INTRODUCTION 


Elementary distributions such as the Poisson, the Logarithmic 
and the Binomial which can be formulated on the basis of simple 
models have been found to be inadequate to describe the situations 
which occur in a number of phenomena. The Neyman Type A (cf. 
Evans [5]), the Negative Binomial (cf. Bliss and Fisher [3]), and the 
Poisson Binomial (cf. McGuire et al. [8]), which combine two of the 
elementary distributions through the processes of compounding and 
generalizing (cf. Gurland [7]), have been fitted with varying degrees of 
success to data from a number of biological populations. The aim of 
this paper is to study what may be called the Poisson Pascal distribu- 
tion which includes the Neyman Type A and Negative Binomial as 
particular limiting cases and serves as a natural complement of the 
Poisson Binomial. 


2. FORMULATION OF POISSON PASCAL DISTRIBUTION: 


Consider the well-known example of the egg masses and the larvae 
illustrating the process of generalizing (cf. Neyman [9]). Suppose that 
the egg masses in a field have a Poisson distribution with probability 
generating function (p.g.f) exp. {A(z — 1)} and that the survivors 
within an egg mass have a Pascal* distribution with p.g.f. (¢ — pz), 
p>0,¢=1+ p,k > 0. For the sake of references, we note that, 
if g(z) is the p.g.f. of a distribution, then the coefficient of z* in the 
Taylor’s expansion of g(z) yields the probability of count x. Then, 
by following the arguments in Gurland [7], we obtain for the distribu- 
tion of the survivors in the field, the Poisson Pascal distribution 
with p.g.f. 


1This research was sponsored in part by the Air Force Office of Scientific Research. 

2Current address: The Florida State University, Tallahassee, Florida. 

3On leave at Mathematics Research Center, U. S. Army, University of Wisconsin. 

4¥Feller [6] refers to the distribution with p.g.f. (¢ — pz)", p > 0,g =1+7,k =1,2,3°°"° etc. 
However, we will use the work Pascal as synonymous with Negative Binomial and use it in the place 
of the latter since it is shorter. 
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exp {Al(q — pe) — 1]}}, A> 9, k>0, p >.0)- G17. (1) 


It is believed that most of the deviation of the distribution of the 
survivors from the simple distributions mentioned above stems from (i) 
the complex structure of survivors within an egg mass and (ii) the 
movement of survivors from place to place. Since the Negative Bi- 
nomial distribution has been found to be very useful in fitting data 
involving this type of heterogeneity, it is reasonable to suppose that 
the Poisson Pascal will give a better description of the population of the 
survivors in a field. 

The Poisson Pascal distribution can also be looked upon as the 
result of compounding a Pascal distribution with p.g.f. (¢ — pz)” by 
taking k, to behave as kx where k is a positive constant and x a Poisson 
random variable with p.g.f. exp {A(z — 1)}. 


3. SOME PROPERTIES OF THE POISSON PASCAL DISTRIBUTION 


The limiting forms that the Poisson Pascal distribution takes as the 
parameters take on extreme values are given in Table 1. It is to be 


TABLE 1 


Some LimitiING Forms oF THE Poisson Pascal DISTRIBUTION 


No. Limits Taken Name and p.g.f. of the limit 
1 k—>o,p—-0 Neyman Type A, exp {Alexp (A(z — 1) — 1]} 
pk = Mt 
2 k—-0,A7> @ Negative Binomial, (¢ — pz)~: 
ON — ky 
3 p—70,+7> © Poisson, exp {A(z — 1)} 
kp = M 


noted that the Neyman Type A and the Pascal distributions are among 
the limiting forms. 

The flexibility of the Poisson Pascal was compared quantitatively 
with that of the Neyman Types A, B, C, Pascal, and Poisson Binomial 
by evaluating the relative skewness and kurtosis of each for fixed 
mean kyp, and variance kyp,(1 + p,) using the indices Ki3}/kip, and 
k1a/kip’ respectively along the lines of Anscombe [1]. The range of 
numerical values of these indices for each of the foregoing distributions 
is shown in Table 2. It is apparent that the Poisson Pascal covers the 
entire range of distributions from Neyman Type A to Pascal with 
respect to skewness and kurtosis. Also, the ranges of these ratios for 
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TABLE 2 


CoMPARISON OF SKEWNESS AND KURTOSIS OF CERTAIN DISTRIBUTIONS 


No. Name Range of Skewness Range of Kurtosis 
i Neyman Type A 1 1 
2 Neyman Type B 9/8 27/20 
3 Neyman Type C 6/5 8/5 
4 Pascal 2 6 
5 Poisson Pascal (1,2) (1,6) 
6 Poisson Binomial (0,1) (0,1) 


the Poisson Pascal and the Poisson Binomial are disjoint. Since their 
p.g.f.s have the common form 

exp {A[(q — pe)" — 1}, (2) 
we observe that the distribution with (2) for p.g.f. wherein \ > 0, 
g=1+7,p>O0whenk > Oand-l<p< 0 when k is a negative 
integer, covers a very wide range of distributions. As an aid in com- 
puting the values of the ratios for the samples to obtain an idea as to 
how close the sample distribution is to the various distributions men- 
tioned in Table 2, formulae to compute the factorial cummulants «;;} 
using sample moments are given in Appendix A. 

Since the first two frequencies were large in the sets of data to 
which this (and similar) distributions were fitted, the ratio of the first 
two frequencies were compared for some of these distributions. It 
can be easily shown that the value of this ratio for the Poisson Pascal 
distribution lies between the ratios for the Neyman Type A and the 
Pascal distributions. The ratios are given for brevity in Table 3. 


TABLE 3 


CoMPARISON OF THE RATIO OF THE First TWO FREQUENCIES WITH MEAN 
AND VARIANCE FIXED AS k,P) AND kiPi(1 + pi) 


No. Distribution Ratio of Frequencies 
1 Neyman Type A kyp: exp (— 71) 
-2 Neyman Type B 2h, {1 — exp (—3pi)(1 + 3p1)}/(pim) 
3 Neyman Type C 3k {2p: exp (—4p1) + 2[—2 exp (—4pi) 
+ 64p? + 4p:]} /4p} 
4 Pascal kip + pi) 
5 Poisson Pascal kip(l + pi)/(k + 1)-F* 


Bol eel 
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4, FITTING POISSON PASCAL AND EFFICIENCY OF 
METHODS OF ESTIMATION 


Since obtaining maximum likelihood estimates is very cumbersome, 
ad hoc methods were used to estimate the parameters. When the mean 
and the variance were large, use was made of the method of the first 
three moments. When they were moderate and the proportion of the 
zero frequency large, use was made of the method of the first two 
moments and the proportion of the zero frequency. When the first 
two frequencies were large in comparison with the remaining, the 
method of the first two moments and the ratio of the first two fre- 
quencies was used in estimation. The equations for estimation are 
given in Appencix C. The fit of this distribution to the data of Beall 
and Rescia [2] are given in Tables 4 and 5. For the sake of reference, 
the fit of a generalization of the Neyman Type A as given by the authors 
of these data are also given alongside. The relatively good fit of the 
Poisson Pascal is apparent. 


For obtaining a comparison of the various methods of estimation, 
it was decided to compute the efficiency function 


‘TABLE 4 


Fit oF THE OBSERVED FREQUENCY OF LESPEDBZA CapitTaTa, TABLE V oF [2] 


Expected frequency 
Observed due to Poisson Pascal Expected frequency 
Plants Frequency (Method of Moments) as In [2] 
0 7178 7185.0 7217.6 
1 286 276.0 218.6 
2 93 94.5 105.5 
3 40 41.5 50.9 
4 24 20.2 24.5 
5 Uf 10.4 11.8 
6 5 5.6 5.7 
7 1 om 2.8 
8 2 Ie 1.3 
9 1 1.0 6 
10 2 6 3 
ililse il 3 A 
x — 9.58 42.97 
Degrees of 
Freedom — 8 9 
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E = 1/(Generalized variance X Information determinant) 
(cf. Cramer [4], pp. 489-497). (3) 


If we denote the parameter vector (A, p, k) by (A; , Az , As) and the 
set of statistics used by (ft, , t2 , t3), we get the expression for the general- 
ized variance of the estimates as 


TABLE 5 


Fit oF OBSERVED FREQUENCY OF LEPTINOTARSA DECEMLINEATA, 
TABLE III oF [2] 


Expected frequency due to 
Poisson Pascal, (method of Expected 
two moments and first frequency 
Insects Observed frequency as in [2] 
0 33 33.0 39.5 
1 12 9.8 6.0 
2 5 7.4 4.9 
3 6 5.5 3.4 
+ 5 4.0 3.2 
5 0 2.9 2.5 
6 2 Pail 2.0 
if 2 15 1.6 
8 2 ih ih 1B 
9 0 8 AW) 
10 1 6 8 
1i+ 2 18? 3.3 
xe ae 6.88 13.75 
Degrees of 
Freedom — 8 9 


2 


(4) 


0 ols) 29 
G@ = | V(t taste) /| es 


where 7; , 72 , 7s are the functions of d, , Az and ); estimated consistently 
by t; , f,¢;. A proof of this is given in Appendix B. Since evaluating 
the covariance matrix V(t, , t, , ts) and the derivatives of t; , te and ts 
follows from the regular statistical techniques, no elaboration need be 
made here. A formula for the information determinant (cf. Shenton 
[10]) is 
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airolg diy ERs Se 

2s PD. Oh, Oa — Es ol Oe ene 

ag 1 OP, oP, 1 aP, aP, ‘1 OP, oP, 
Gs ye Poo pe Paco ye Puro (5) 

1 aP, aP, 1 aP, oP, > 1 OP, oP. 

1 Pais Gy Pato OM eee 


The principal problems in evaluating J for a value of (A, p, k) therefore 
are (i) to evaluate the various P, and the derivatives of P, and (iz) 
to determine the number of terms to be used in summing the infinite 
series. To obtain P, , let 


g(z) = exp {Al(q — pe)” — 1} (6) 
and 

h@ = (q — pa)". 
By differentiating (6) successively we get 


g’(2) = Ag@h’@), (7) 
and 
G2) = \ Ds & ‘ Ns @Aer-P@), (8) 


Set z = 0 in equations (6), (7) and (8) and observe that g’(0) = x! P. 
and h‘*” (0) = x! 7, where 


(he od) es ees 
Weg eae ? (9) 


is the probability of x in the Pascal distribution with h(z) as p.g.f. 
Then we have the recurrence formulae 


Py = exp. {r[(q)* — 1} (10) 
and 
Pra = aH (x Sn Be, M241 = iP, (11) 


which can be repeatedly used to evaluate any P, . 


The various derivative are given by 


SP oP sao" (6 tae 
coe ral a) |\ iS 112 Lae Ms yy 


eee 
= kp (x =f DP i pi a “|, (12) 
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oP, OR iL} “ily i ae lhe 
Bie ep. el lees her g() Sa rs ge) 
5 ‘p @! \o2" fOr oe EOS ED aE 
1 
=F (aP.-@+0)P.} (8) 
and 
Darga Be E 
Gi Ok al leet | ae No 
Los 
=F Vaat LT AI@A@ log (a — pely 
Ile ly ; : 
ees Ds (w/a) | (ae — 4+ DP. — P@ — OP.-s} 
1 a 
oe (log g){q(@ + 1)P.+1 — pxP;} (14) 
for all z. 
TABLE 6 
EFFICIENCY OF THE MEetTuop oF THE First THREE MoMENTS FOR 
THE PoIssoN PASCAL 
k 
x p 1 33 5 1.0 2.0 
mit a 84 .82 82 81 .76 
aah aS 59 .58 .58 54 AT 
ail ao 45 .44 43 40 508) 
nit 1.0 26 NS 24 22 .18 
Ell 2.0 13 12 12 12 Palle 
.5 al 90 81 82 77 .67 
aS 3 59 .58 56 49 .00 
“5 53) 46 44 41 34 a2 
BD) 1.0 26 2D 23 18 10 
ad) 2.0 13 pill 12 — — 
1.0 oil 81 83 82 75 63 
1.0 3) 59 .59 155) 46 .28 
le) BO 46 44 40 31 oss 
10 1.0 27 25 Me) ES .05 
1.0 —_— — — =S a — 
5.0 al g4* 99* .78* 58* sane 
5.0 3 62* 56* 41* ile nO 
520 a 48* 38* 20" 04* .00* 
5.0 1.0 29* er .06* 02* 00* 
5.0 — — oe aa = a= 
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To determine the number of terms we use the following rule: 
Let T,,;(n) denote the (n + 1)th term in the series involved in the 
(i, j)th term of matrix (5). Let S;;(n) ates T,;(r). Compute 


S;;(n) and Ee T?,(n)/Si;(n) for n = 1, 2, --- et cetera successively 
till a value of n is reached for which 
X Ti) /Si(a) < 10°. (15) 


It is clear from (15) that | 7';;(n) |/| S:;(n) | <10~* for each 7 and j. 
If the series converge faster than a geometric series with common 
ratio less than 0.9 and this convergence starts before the value of n is 
reached for which (15) is satisfied, the calculated efficiency will be 
correct to three significant figures. If the significant figures do not 
cancel out, this should yield the efficiencies computed therefrom, correct 
to three decimal places. When the inequality (15) was not satisfied 
for values of n < 20, the partial sum of the first twenty terms was 


taken as the value of the series since evaluating the terms of the series 


TABLE 7 


EFFICIENCY OF THE METHOD oF THE First Two MoMENTS AND THE FIRST 
FREQUENCY FOR THE PoIssonN PascaL DISTRIBUTION ATA = 0.1 


k 
r p 1 3 5 1 2.0 
oll sik 99 98 .98 98 98 
al 3 93 94. .95 94 93 
gil 5 90 90 Gil 91 89 
all 1.0 82 83 84 84 83 
oil 2.0 74 76 18 84 78 
5 dl 1.00 1.00 .98 96 95 
5 3 93 94 .93 92 87 
5 5 91 91 .90 89 82 
5 0 83 85 .85 82 73 
5 2.0 75 78 81 = == 
1.0 al 1.00 oo {9 96 94 
Ta a) 94. 96 94 93 87 
1.0 5 92 92 91 98 81 
1.0 1.0 84 85 .86 82 72 
1.0 2.0 88* 87* Bodice sells 83* 
5.0 ol = = 199" .89* 98* 
5.0 3 OTe 99° 1.00* 1.00* 98* 
5.0 am) 95* yl .98* 995 96* 
5.0 1.0 .90* . 96* Oe — = 
5.0 2.0 89" — — — — 
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for n larger than 20 is very time consuming. The efficiency when n 
was restricted to 20 is marked with an asterisk to indicate that they 
are less likely to be correct to three decimal places. When the efficiency 
so computed was larger than one (due to the inaccuracy in computing 
the information determinant), the corresponding cell in the efficiency 
table is left blank. 

The efficiency of the method of moments is given for certain values 
of (A, p, k) in Table 6. The efficiency of the method of the first two 
moments and the first frequency is given in Table 7 and that of the 
method of the first two moments and the ratio of the first two fre- 
quencies in Table 8 for the same values of (A, p, k). 

It is apparent that the method of the first two moments and the 
ratio of the first two frequencies has high efficiency and is superior to 
the other two methods when 2 is small (and consequently the first 
two counts account for a large proportion of the observed frequencies). 
Also the method of the first two moments and the first frequency is 
highly efficient when X is moderately large. When X, p or k approaches 


TABLE 8 


EFFICIENCY OF THE METHOD OF THE First Two MoMENTsS AND THE RATIO OF 
THE First TWO FREQUENCIES FOR THE PorIssoN PAscAL ATA = 0.1 


k 
r p 1 3 5 1.0 2.0 
1 i 1.00 1.00 1.00 1.00 1.00 
i 3 98 99 .99 99 98 
‘A 5 99 98 .98 97 94 
1 1.0 94 94 93 91 83 
ot 2.0 87 88 89 91 99 
5 a 1.00 99 1.00% 1.00 99 
5 a 98 1.00 99 98 96 
5 1.0 99 98 97 96 92 
5 2.0 94 94 93 89* 80* 
1.0 el 98 1.00 1.00 1.00 99 
1.0 eC 99 1.00 99 99 95 
1.0 5 1.00 99 98 96 91 
1.0 1.0 95 94 93 88 79 
1.0 2.0 88* 89* 91* au a 
5.0 A ae a 99% 95* g1* 
5.0 8 1.00 99* 98% 89* 76* 
5.0 5 1.00 96* 91* 80* 65% 
5.0 1.0 95* .88* 79% 84* 1.00* 
5.0 2.0 90* 98* = = — 
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infinity, it can be shown by calculus that the method of moments is 
much superior to the other two but since the efficiency of each of these 
methods tends to zero for such values, this has little significance. 


5. CONCLUSIONS 


On the basis of the properties discussed in Section 3 and the fitting 
in Section 4, we observe that the Poisson Pascal distribution acts as a 
bridge between the Neyman Type A and the Negative Binomial dis- 
tributions and may be used with advantage when the latter distribu- 
tions are inadequate to represent the population accurately. 

From the tables of efficiency, it is clear that in the region of tabula- 
tions, at least one of the ad hoc methods of estimation suggested above 
has high efficiency. It is believed that in practice, (A, p, k) will not 
be far beyond the region of tabulation and that one of these methods 
can be used without too much loss of information. Techniques for 
choosing one of the many ad hoc methods on the basis of the sample 
will be discussed in a future paper. 
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APPENDIX 


A. Formulae to compute factorial cumulants using moments about the origin: 


We first obtain formulae to compute the first four factorial moments 
wus, 2 = 1, --+ , 4 using moments about the origin y; and then obtain 
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formulae to evaluate the first four factorial cumulants «,,,7 = 1, --- ; 4 
using these factorial moments. 
As for the first objective we note that u;;;=H{x(@— 1):--(e—-7+1)}, 
i = 1,2,---. By expanding the product within the expectation sign 
and using the elementary properties of the expectation operator, we get 
Ba) = Mi, eile itr a Le ) nen MS = Sus a Di 5 
and 
Mya) = we — Ong + 1p — Grr - (16) 
As for the latter, we observe that if w(¢) and y(t) denote the factorial 
moment generating function and the factorial cumulant generating 
function, then y(t) = log u(t). On differentiating the equation 
successively with respect to t att = 0 and noting that «i; = vy (0), 
we have 
VSiti| ae deeay sD LS) =" (AN) wi ) 
Kpaq) ane O08) SMH 2) sie Quin ) 
and 
Ki4) = Pi4] 3ui2) — 4y\Misy 1 12QuiriHi21 =. bhi) : (17) 


B. To prove formula (4): 

Since 7; , 72 , Ts are functions of A; , and Ag , let us write them more 
explicitly as 71(A1 , A2, Na), Te(Ar » Az » As) and 73 (A1, Nz , As) respectively. 
If {1 , Xe, Ae are the estimates of \; , \2, As, then using the statistics 
t, , tg , tg which estimate the 7’s consistently, we have 


PetOwosos 4 = 15 2,3: 


t 


Hence, we have the asymptotic relations 


OT; OT: ee ne 
ee Seg eg fh oe ee 
ae ER (Ste, oe (ha = Any Ap — Aa ha — Da), 


which can be rewritten as 


he ay My ts =e 1 | 
d(14 T2 5 aa 
et ee oer 
Ay — Ya Ee EN TINS) Laer 
x = As bs ramets 


The generalized variance G of 4, , As, ds is then given by 
Ge | HQ, — rr, Az — hz ,As — Xz)’ = Ne — he) As — )s3) | 


O(71 , Ta » Ts) 
= | Vii, tb) /\% » Ao As) 


which is formula (4). 


2 
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C. Equations for estimation: 

We give below the equations for estimating the parameters for the 
various methods mentioned in Section 4. Their derivations are omitted 
for brevity. 


i. Equations for the estimation of parameters using the first three 
moments are: 


k = - - Reka 3 — yy (18) 
K3]Kt1j (R(2)) 

P = Ria/(Riy(k + 1), (19) 

A = Rirj/(kp). (20) 


i. Equations for the estimation of parameters using the first two 
moments and the proportion P, of zeros are: 


p log {1 a (2 -p) lox Pol ae (i ~p) log (1 + p), (21) 


Kiy] Koi 
a Le | 
PK) 


ii. Equation for the estimation of the parameter p using the first 
two moments and the ratio P,/P, of the first two frequencies is 


log (1 R aes 
(1 + p) = log tenis Pa) (24) 
Pp K(2] 


k and } are then estimated by using equations (22) and (23). 


’ A = Ki /(kp). (22) 


SOME RANK SUM MULTIPLE COMPARISONS TESTS 


Rosert G. D. STEEL 


Institute of Statistics, North Carolina State College 
Raleigh, North Carolina, U. S. A. 


1. SUMMARY 


Rank sum tests for multiple comparisons and suitable to the 
completely random design with equal sample sizes are discussed. 
Significance tables and some facts about the joint distribution of rank 
sums are given. An example illustrating test procedures and making 
use of the tables is presented. 


2. INTRODUCTION 


A number of nonparametric tests based on ranks have been pro- 
posed for the comparison of treatments in a completely random design. 
For example, we have the Wilcoxon-Mann-Whitney test (21-10), 
basically a two-sample test with a per-comparison error rate. 

Also, Kruskal and Wallis [9] have proposed a rank test which is an 
analogue of Snedecor’s F-test. This test provides evidence concerning 
the presence of real differences but is of limited use in locating them. 

Steel [16, 17] has presented rank tests for comparing treatments 
against control and for all pairwise comparisons. Both of these tests 
use experiment-wise error rates. 

Pfanzagl [13], as part of a more general theory, has discussed a 
two-step nonparametric decision process based on ranks, for testing 
the null hypothesis that & samples come from the same population and, 
if this is rejected, for deciding which one of the samples comes from a 
different population. No tables are given but it is suggested that they 
might be obtained by random sampling. It is also shown that the 
limiting distribution of the multivariate criterion is multinormal. 

The per-comparison error rate test is sometimes criticized, par- 
ticularly when all possible paired comparisons are made, because it 
will almost certainly lead to false declarations of significance when the 
experiment includes many treatments and if customary significance 
levels are used. It is also deemed inappropriate when the experiment 
is considered to be the conceptual unit. 

The experimentwise error rate test is sometimes criticized because it 
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requires such a large difference to be declared significant that it becomes 
difficult to detect any but the largest real differences when customary 
significance levels are used. Also, it may be that the individual com- 
parison is considered to be the conceptual unit. 

A brief discussion of these error rates is given by Steel [18] in re- 
sponse to Biometrics Query 163. 

Choice of a definition of error rate in the conduct of a particular 
experiment seems somewhat less crucial when it is realized that we can 
compute the significance level for a particular definition of error rate 
from knowledge of the chosen significance level for any other definition 
of error rate. ‘This is not generally a simple computation unless the 
comparisons are independent. In the case of p independent comparisons, 
if a’ and @ are the experimentwise and per-comparison error rates 
respectively, we have the relation: 1 — a’ = (1 — a)’. 

When comparisons are not independent, computation of comparable 
significance levels for different definitions of error rate depends upon 
the extent of the dependence and the nature of the multiple comparisons 
test. No individual is likely to perform such a computation for a 
single experiment. Thus a table needs to be prepared for comparing 
the customary significance levels for differently defined error rates. 
For tests based on an underlying normal distribution, this has been done 
fairly extensively by Harter [4, 6]. 

The experimenter may try to meet the usual criticisms of per- 
comparison and experimentwise error rates by choice of a non-standard 
significance level or an alternative test procedure. Presently, tables 
of significant values for such levels do not appear to be available for 
experimentwise error rates; in the case of alternative tests, several are 
available, including the Newman-Keuls [11, 7] procedure and Duncan’s 
[1,5] new multiple range test, which are sequential in nature. 

This paper is concerned with rank tests, in particular with tables for 
an all-pairs-of-treatments test with an experimentwise error rate, and 
the use of these tables for a fixed rank sum test, an analogue of Tukey’s 
w-procedure (for example, see Steel and Torrie [19]), and for two multiple 
rank sum tests, analogues of the Newman-Keuls procedure and of 
Dunean’s test. Table 2 is used in the first two cases, Table 3 in the 
last case. 


3. CONSTRUCTION OF TABLES 


The proposed tests call for rank sums and their conjugates computed 
for all pairwise comparisons of treatments. The minimum of each 
rank sum and conjugate is used, the set of minima providing a multi- 
variate rank sum test criterion. These sums are compared with a single 
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tabulated value for the fixed rank sum test and with several values for 
the multiple rank sum tests. Table 2 provides critical values for the 
analogues of Tukey’s and the Newman-Keuls tests; Table 3 provides 
for the analogue of Duncan’s test. 

Methods for constructing probability tables and limited tables have 
been presented earlier [16, 17]. Construction of exact tables of any 
extent was beyond the computing facilities available. However, some 
machine time was available and this was used for some sampling ex- 
periments. 

It was originally intended to ignore the discrete nature of the data 
and to use the Kolmogorov-Smirnov [8, 14] one-sample test to determine 
the sample size necessary to attain a certain precision in the constructed 
tables. However, available computing facilities limited sampling to 
values of k = 3 and 4 (2 and 3 treatments when one was control) and 
n = 4 (1) 10. In addition, samples were obtained for k = 5, n = 4, 
5 and k = 6,n = 5. The number of permutations obtained for each 
case was either 5000 or 6000. These tables were used only for checking 
purposes against the few exact distributions available, k = 3 and 

= 3 (1) 6, and against approximations used in constructing these 
and earlier tables. 

It was assumed that the various multivariate rank sum criteria 
are distributed approximately as multinormal distributions having 
mean vectors and variance-covariance matrices as given in the appendix. 
(Fraser’s [3] vector form of the Wald-Wolfowitz-Hoeffding theorem 
does not apply since the || C,.(¢, j) ||’s of Fraser do not exist for the 
test criteria used here.) 

On this assumption, one naturally proceeds to base computation 
on presently available tables. Tables for Tukey’s and Duncan’s tests 
are the obvious choice for all-pairs tests. These tests are based on a 
multinormal distribution with p” = nin;/(m, + n:)(m, + n;) where the 
present distribution calls for p? = nn;/(mx + ms + I) (rm + 1 + 1), 
a small difference. The appropriate tables are, then, tables of the 
Studentized range with known variance, that is, infinite degrees of 
freedom. Table 22 of Pearson and Hartley [12] is such a table, gives 
percentage points of .10, .05 and .01, is appropriate for the first two 
tests, and was used in computing Table 2. Corrected tables for Duncan’s 
test have been computed by Harter [5] and this table was used in 

computing Table 3, also for percentage points of .10, .05 and .Ol. 
Table 2 was constructed by taking the integral part of wp — to/ V2, 
unless the decimal fraction was > .9, in which case the next higher 
integer was tabulated, where ¢ was obtained from the distribution of 
w/o, w = range, Table 92 of Pearson and Hartley [12]. Since rank 
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sums are essentially differences, it is necessary to introduce 1/2 into 
the denominator as shown. Tabulated rank sum values for a = .10 
differed in only two cases from values obtained by sampling. In par- 
ticular, for k = 4, n = 4, no value is significant by sampling; for k = 3, 
n = 6, a rank sum of 25 is significant by sampling whereas 26 is not. 
Values for a = .05 differed in no case. Values for a = .01 ran lower 
than those obtained by sampling, the difference increasing with n to a 
value of two in three cases. Hence, it is reasonable to conclude that 
tabulated values of the rank sum are conservative (low) for a = .01. 

The first attempt to construct Table 3 led to values which tended to 
run high for a = .10, correct for a = .05, and low for a = .01, relative to 
values found by sampling. For this reason, tabulated values are of 
u — to/+/2 decreased by unity for «a = .10, as computed for a = .05, 
and increased by unity for a = .01. On this basis, tabulated values 
for a = .10 appear to be low, hence conservative, when not in agreement 
with sampling results; in particular, 7 out of 19 tabulated values are 


one unit low. For a = .05, 3 out of 19 values are one unit high. For 
a = .01, 5 out of 19 values are one unit low, with three of these being 
forn = 5. 


Tables have already been constructed [16], using Dunnett’s [2] 
tables, for the treatments against control test. These tables agree well 
with the sampling results. In no case is there a difference of more 
than one in the value of the test criterion, with the tables most often 
giving the conservative (lower) value. 


TABLE 1 


FinaL WeIGHTS OF CuIcKs AT Six WEEKS (GRAMS) FOR VARIOUS 
SOURCES OF PROTEIN SUPPLEMENT 


H L Sb Sf M 

Horse- Linseed Soybean Sunflower Meat C 

bean Oil Meal Oil Meal Seed Oil Meal Meal Casein 
179 309 243 423 325 368 
160 229 230 340 257 390 
136 181 248 392 303 379 
227 141 327 - e330 315 260 
217 260 329 341 380 404 
168 203 250 226 153 318 
108 148 193 320 263 B02 
124 169 Pa 295 242 359 
143 213 316 334 206 216 
140 257 267 322 344 222 
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4. USE OF TABLES 


To illustrate the use of the tables, the data in Query 60 of Biometrics 
(15) are used. These are presented in Table 1. Since the test is presently 
unavailable for unequal sample sizes, only the first ten items in each 
treatment are used. 


The following set of minimum rank sums is obtained by pairwise 
rankings, a minimum being the lesser of the rank sum 7’, and its conju- 
gate, 7’ = (2n + 1)n — T; minimum treatment is the treatment for 
which the rank sum is minimum. Ties were assigned their average 
rank. This gives a multivariate criterion with 15 entries. 


Comparison H, Sf H, Sb ANG L, Sf H, M lbp G 
Minimum Rank Sum 56 57 58 60 62 644 
Minimum Treatment H H H L H L 


Comparison Sb, Sf H,L L,Sb L,M 8b,C Sf,M M,C 8b,M Sf, C 
Minimum 

Rank Sum 71 75 765) 75 
Minimum 

Treatment Sb H L ile; Sb Sf M Sb Sf 


80 82 84 100 103 


i 


From Table 2, a rank sum of 67 is significant at the 5 percent level, 
k = 6, n = 10; hence six comparisons are declared significant. 

The device, used with multiple comparisons procedures, of under- 
lining treatments which cannot be distinguished by their means may 
be adapted to apply to rank sum procedures. Thus, from the test, 
it appears that H and L, and L, Sb, M, Sf and C form two groups 
as a first step; since L can be distinguished from Sf and C,, the latter 
group becomes two, namely L, Sb and M, and Sb, M, Sf and C. We 


have: 
H L Sb M Sf G 


Ordering of Sb and M, and of Sf and C was done on the basis of rank 
sums for these paired comparisons though this does not imply that 
this is the only, or even the best, method. 

This procedure is an analogue of Tukey’s w-procedure [19]. The 
significance level is for an experimentwise error rate. Use of Wilcoxon’s 
[21] two-sample test, with its per-comparison error rate, calls for a 
significant rank sum of 78; this will result in four more comparisons 


being declared significant. 
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TABLE 2 


Percentace Points or THE Minimum Rank SuM 
(AN APPROXIMATION ) 


Number k = number of treatments being tested 

in a SS 

treatment 3 4 5 6 ¥ 8 9 10 

10 10 10 = == — = a 74 

4 05 —_— — = = — = — == 

O1 —_— — = a= = ae aa! = 

10 17 16 NB} Ss — — == == 

5 05 16 15 — — = = — — 

01 — — — = = = —— = 

10 26 24 23 22 22 21 Dik — 

6 05 24 23 22 21 — — —_— — 

O1 -—— — — — = = = = 

10 36 34 oo 32 31 30 30 29 

uf 05 34 32 31 30 29 28 28 — 

01 29 28 — — — — — — 

10 48 46 44 43 42 41 40 40 

8 05 45 43 42 40 40 39 38 38 

on 40 38 37 36 — — — —_ 

10 62 59 57 56 55 54 53 52 

9 05 59 56 54 53 52 Sl 50 49 

01 52 50 48 47 46 45 —_ —_ 

sl@ He 74 We 70 69 68 67 66 

10 .05 74 71 68 67 66 64 63 63 

Ol 66 63 62 60 59 58 57 56 


It is also possible to propose and carry out a sequential procedure, 
an analogue of the Newman-Keuls procedure, which uses several rank 
sums for testing. For this procedure, the above analysis is the first 
step and has separated the treatments into three groups. To proceed, 
we assume that declared differences are indeed real. Hence to test 
H versus L, the first group, we may use Wilcoxon’s [21] two-sample test, 
H and L are declared significantly different and the line beneath them 
may be removed. 

Further, compare L, Sb and M using the critical value for k = 3, 
n = 10, namely 74. L versus Sb and L versus M at 75 and 753 are be- 
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yond the 10 percent point but are not quite significant. We cannot 
distinguish among these three treatments. 

Finally, consider the group composed of treatments Sb, M, Sf and C. 
The critical value is 71, k = 4,n = 10. The treatments Sb and Sf can 
be distinguished and we must, then, change the order of Sf and C from 
that proposed when the fixed rank sum test was used. No further 
differences will be declared significant by this procedure. We have: 


H L Sb M C Sf 


For the Tukey and Newman-Keuls parametric procedures, we find 
means of 160.2(H), 211.0(L), 267.4(Sb), 278.8(M), 323.2(Sf) and 
326.8(C). Also s; = 18.03 and significant ranges are 512,615, 67.6, 
71.9 and 75.2 for k = 2, --- , 6 respectively. 

For Tukey’s test, the fixed rank sum is 75.2. We find: 


H L Sb M Sf C 


For the Newman-Keuls procedure, we find: 
H L Sb M Sf C 


We now compare the results obtained from applying the parametric 
and non-parametric procedures. 

The fixed rank sum test and Tukey’s test lead to the same con- 
clusions. Both are based on experimentwise error rates. 

Conclusions drawn from the multiple rank sum test and the Newman- 
Keuls test differ as follows. L versus M is significant by the Newman- 
Keuls procedure only; Sb versus Sf is significant by the rank sum test 
only. Otherwise, the procedures lead to the same conclusions. Fifteen 
paired tests have been made. Since L versus M is nearly significant 
and Sb versus Sf is just significant by the multiple rank sum test, it 
would appear that the two methods lead to conlcusions, for this ex- 
ample, that differ only slightly. 

The other multiple rank sum test to be considered is an analogue of 
Duncan’s new multiple range test. Table 3 provides critical values. 
A rank sum of 75 is significant at the 5 percent level, k = 6, = 10; 
hence nine comparisons are significant. Tentatively, we have: 


H L M Sb C Sf 


Unfortunately, this includes an anomaly since Sb versus Sf is also de- 
clared significant. The same would be true if the Wilcoxon-Mann- 
Whitney test were being used at a significance level (between 5 percent 
and 1 percent) calling for a critical value of 75. 
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TABLE 3 


PERCENTAGE PoINTs oF THE Minimum RANK SuM FOR DUNCAN ANALOGUE 
(AN APPROXIMATION ) 


Number k = number of treatments being tested 
in a SS SS 

treatment 3 4 5 6 7 8 9 10 
10 11 10 10 10 10 10 10 10 
4 05 10 10 10 10 10 10 10 10 
01 — — oe — = == = = 

10 18 18 Lud a iy is 17 
5 05 iG 17 16 16 16 16 16 

01 15 15 15 — — = 
10 ih 26 26 26 26 25 25 25 
6 05 26 25 25 25 Dis; 24 24 24 
O01 23 22 22 22 22, PA 21 21 
10 37 37 Si 36 36 36 36 36 
u .05 36 3D 35 35 34 34 34 34 
O01 32 32 ol 31 30 30 30 30 
10 50 49 49 49 48 48 48 48 
8 05 48 47 47 46 46 46 46 45 
01 43 42 42 Al 4l 41 41 40 
10 65 64 63 63 62 62 62 62 
9 05 62 61 60 60 60 59 59 59 
01 56 55 54 54 53 53 53 52 
oll) 81 80 79 79 78 78 78 Oil 
10 .05 77 76 76 18 75 74 74 74 
01 70 69 68 68 67 67 67 66 


On the basis that treatments declared significantly different are 
indeed so, we proceed to test treatments /, Sb, C and Sf using k = 4 
and LZ and M using k = 2. The final result is: 


H L Sb M C Sf 


This result is the same as that obtained using the Wilcoxon-Mann- 
Whitney test. 


Using Duncan’s (parametric) new multiple range test, we find: 


H L Sb M Sf C 
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Conclusions from Duncan’s test and its rank sum analogue differ 
only in that the parametric test finds Sb versus C to be significant. 

The multiple rank sum test can be adapted to apply to testing 
treatments versus control as well. 


5. APPENDIX—THEORY 


The problem is concerned with pairwise testing of treatments in the 
one-way classification or completely random design. 

The test criteria are rank sums, computed as for Wilcoxon’s [21] 
two sample test, for appropriate pairs of treatments. Rank sums will 
be referred to Tables 2 or 3 for testing rather than to White’s [20] table 
for the Wilcoxon-Mann-Whitney test. 

Two tests will be considered: 

1. The all pairs test, in detail. 

2. Treatments against control, rather briefly. 

For the all pairs test, let X; ,7 = 1, --- , & be random variables 
measuring some characteristic for each of k samples or treatments. 
Let there be n; observations on the 7-th treatment. Computation of 
the test criterion requires us to: 

1. Rank the X,’s and the X,’s, all 7 < j, assigning rank 1 to the 

least observation. 

2. Add ranks for the variable with fewer observations to give T';; . 

(There is no loss of generality if we assume 7, <n Sos eat 

3. Compute the conjugate of 7; , namely Te = (+04 teal cae 

The conjugate is the rank total that would be obtained if rank 1 
were assigned to the highest observation. Conjugates are required for 
two-tailed tests. 


Considers las yee es ae a T,.-1,x), @ criterion with (#) 


components. Rank tests are based on the assumption that, under H, , 
all permutations of the >- n; observations are equally likely. Hence, we 
must know the number of ways in which (71; , -*° ; T,-1.3) can be 
obtained. For this, a recursion formula is given in [14]. This provides a 
method, though tedious, of deriving the distribution of (12, °°° , Tx-1.x) 

The distribution of (T12, °°: » T's-1,.x) has been shown to have the 


following parameters [14]: 
BQ) = pe = ulm, +2; + 1/2, 
He ay = Gay = tans 1) / 12, 
EE = fie) Une Ont = nnmn;/12 = oni,ii 


EQ: =; bn (Ti; = Mii) = Ue = —nnmn,;/12, 
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ET, a Mor ML ii = Mii) =) Oghi aa | on 0} 
(eke = Pine = ere = nmn,/(m, +1; = 1)(n, Sti acta 
Doh, ii = 0. 


The determinant of the variance-covariance matrix is: 
k 
Meese 


The elements in the inverse of the variance-covariance matrix are: 
Corresponding to the variance of 7’; , 


12 Pie id Ne a n)/nn( 2, Ma tL). 


Corresponding to a covariance with the T’s having a common 
subscript, h, in the same position: 


SOS Ne +1). 


Corresponding to a covariance with the 7’s having a common sub- 

script, 7, in different positions, for example 7; and T;; : 
12. ona 

Finally, the element corresponding to o,,,;; iS zero. 

The determinant of the variance-covariance matrix may be evaluated 
as follows: from the ith row of the determinant, factor mm,;/12, 
i = 1,---,k — 1; from the ({&k — 1] + 2)-th row, factor nyn2,,/12, 
i=1,---,k — 2; --- ; from the last row, factor m,_1,/12. The 


k 
product of these factors is [] !~*/ 19%) 


The entries in the determinant which is the other factor may be 
described somewhat crudely as follows. 

The 7-th diagonal block, 7 = 1, --- , k — 1, which contains the 
variances and covariances of the 7’;,;’s, fixed 7 and 7 > 2, will be 


nN; ae Ni41 ai 1 N42 eae Ny 
N41 La (ae OPE ns walla Wea Ny 
Nid Ni+2 Deo Ota (De ae 


This is a (k — 7) X (k — 12) block. 
The block consisting of the same rows and the first (k — 1) columns is 


0 a as 0 —nN, Ny, 0 0 ip ge 0 
Os 20 mar MU) ae Ue secre 
08-8100 ne BONO Oar 

SSS ee 


4 — 2 columns k — <4 columns 
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The next block to the right consists of k — 2 columns and is: 


0 eae Us Nz Nos 0 0 it PS 0 

IOs iS see =e Qeens 10, arte 0 

Oo S2e(0 Re 0.20 eek 1a 
SS SS Wa ye 
2 — 3 columns k — 7 columns 


The pattern is now clear. . 
The first block to the right of the 7-th diagonal block consists of the 
next (k — [¢ + 1]) columns and is 


ae Sn eh, 
Neat Onaece C0 
0 ey ae et 
eal Pane cam OF 
The block to the right of this is 
0 Ce) cae mere. 
—Nisg —Niss —N 
Neve 0 0 
0 DKS 0 
0 On wore =n 


Again, the pattern is clear. 

At the next step in evaluating the determinant, from column 1 sub- 
tract columns ([k—1]+1) through ({k—1]+[k—2]). To column 2, add 
column ([k — 1] + 1) and subtract columns ([k — 1] + [k — 2] + 1) 
through ([k — 1] + [k — 2] + [k — 3]). In general, to column 
i,4 = 1,--: , k — 1, add the 7 — 1 columns described as those con- 
taining the first column of the diagonal blocks previously set out, and 
subtract the sum of the next (k — [2 + 1]) columns, that is, the columns 
including the (2 + 1)-st diagonal block. 

The result of the above operations is that the first diagonal block 
has >> n; + 1 as common diagonal element and zeros elsewhere. The 
- other blocks containing the first k — 1 columns are of the form previously 
described but with —n, and n, replaced by — (>> n, + 1) and Yn +1 
respectively. 

At the next and final step, leave rows 1, --- , & — 1 unaltered. 


550 BIOMETRICS, DECEMBER 1961 


Consider the next k — 2 rows which we now call the first set, the next 
k — 3 rows called the second set, and so on, the i-th set consisting of 
(k — [i + 1]) rows, beginning with row ee =") de ome 
j-the row of the 7-th set, add row 7 and subtract row 2 + j, these rows 
coming from the first k — 1 rows of the determinant. 

The result of these operations is a determinant with ys nm, + 1in 
the first k — 1 principal diagonal positions, ones elsewhere in the princi- 
pal diagonal, and zeros below the principal diagonal. Hence the de- 
terminant is as given. 

That the inverse elements are correctly given may be checked by 
multiplying the matrix by the given inverse. 

For the treatments versus control procedure, let X;,7 = 0,1,--- ,k 
be random variables measuring some characteristic of a control and 
k treatments with n; observations in the 7-th sample. 

Computation of the test criterion requires us to: 

1. Rank jointly the X,’s and X,’s, 7 fixed, giving rank 1 to the 

least observation. 

2. Add ranks for the variable with fewer observations, here assumed 

to be the check, to give T; . 

3. Compute the conjugate, 7% . 

Consider (7, , -:: , T,). Again, a recursion formula for finding the 
number of permutations which give rise to a specific value of 
(T,, °°: , T;) is given in [19]. 

The distribution of (7, , --- , 7.) has been shown to have the follow- 
ing parameters [13]: 


EC ,) = uw; = Mr + 2 + 1/2; 
ET; — Mi)” == NoNi(No +n; + 1)/12, 
ED; — w)(T; — wi) = o4; = nonn,/12, 
pig = NN;[ (mo +2; + YD(m +7; + 1). 


It may also be shown that the determinant of the variance-co- 
variance matrix is 


II :(no[Mo + md n: + 1/12. 


The diagonal and off-diagonal elements of the inverse are, respectively, 


DO) Ee 4 pn > Ne + 1) | 


ard 


-12/| min +9(Sm+ i) |. 


I 


and 
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To evaluate the determinant of the varlance-covariance matrix, 
factor non,/12 from the 7-th row of the determinant. 
This gives: 


Toe Rea eS 1 Nz sis Ny 
ae apa ny NM +m +1 --- Ny 
a Ns eee yet Wily teak 
Next, subtract the k-th row from the 7-th row, 2 =~1, ,k —1. 
We obtain 
ae: II n, To 1 0 wy —(No a 1) 
: 73 0 Mm +1: ee 1) 
eer ie tae me athe il 


Finally, obtain a new k-th column as the sum of all columns. It is 
then apparent that the given determinant is correct. 

That the given inverse elements are correct is seen by multiplying 
the matrix by the stated inverse. 

From the above information, it is possible to tabulate probabilities 
for the parent distribution and derived distributions of interest. 
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THE ESTIMATION OF REPEATABILITY AND 
HERITABILITY FROM 
RECORDS SUBJECT TO CULLING 


R. N. Curnow! 


Agricultural Research Council Unit of Statistics, 
University of Aberdeen, Aberdeen, Scotland? 


1. INTRODUCTION 


In any animal breeding selection programme, estimates of repeat- 
ability and heritability are needed to choose between the various 
selection schemes available and also to ensure that the highest possible 
genetic gains are obtained from the chosen scheme. Estimates of 
repeatability and heritability are generally subject to large sampling 
errors. Therefore, the most efficient methods of estimation should 
be used even if they do involve rather lengthy computations. In this 
paper, the maximum likelihood estimation of repeatability and herit- 
ability from records subject to culling will be considered. The more 
usual regression estimators are often very inefficient compared with 
these maximum likelihood estimators. 

Suppose that we wish to estimate the repeatability of lactation yield 
in a herd of dairy cattle. We shall assume that only first and second 
lactation yields are available and that, if there had been no culling 
(i.e., if all the cows had had second records as well as first records), 
the first and second records would have been normally distributed over 
the herd with means p, and py , variances oj and o; and covariance 
between the two records of the same cow o1.. The first and second 
records of cow 7 will be written y;, and y,. respectively. The assumption 
of normality for the distributions will probably be a reasonably good 
approximation unless the herd can be split into groups so that any two 
cows in the same group are much more alike than two cows in different 
groups. These groups may, for example, be groups of daughters of the 
same sire or groups of cows according to the year in which they gave 
their first record or the month in which they calved. Methods are 
available for making allowances for such groupings, but they will be 
assumed absent in the rest of this paper. Very rarely will the culling 


1The work on this paper was undertaken while the author was visiting the United States on a 


Harkness Fellowship of the Commonwealth Fund. 
2Present address: Unit of Biometry, University of Reading. 
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in the herd be sufficiently intense or sufficiently highly correlated with 
future milk yielding capacity to affect seriously the normality of the 
distributions. 

Repeatability is defined as the correlation between two different 
records of the same cow and is therefore 


Pp = 012/0102 « 


Since we are considering only first and second records, the question 
of whether the repeatability is the same for all pairs of records will not 
be discussed. The assumption is frequently made that oj = o} and, 
therefore, that repeatability is the same quantity as the regression 
coefficient of second records on first, 82: = o12/c0; . The assumption 
is made, for example, in the formula given by Lush [1945] for comparing 
cows with a differing number of records and in the formula given by 
Lerner [1958] for the ratio of the heritability of the mean of n records 
to the heritability of a single record. It was also used by Henderson, 
Kempthorne, Searle and von Krosigk [1959] in their discussion of the 
disentanglement of environmental and genetic trends from records 
subject to culling. When of # o; , an estimate of 8, is needed for 
prediction purposes but for other purposes estimates of o7 and o; may 
also be required. We shall assume in this paper that oj = o; = o°, 
say, and therefore that @., = p. A logarithmic transformation applied 
to the data may sometimes be useful in satisfying this assumption. 

p is generally estimated by b, the sample regression coefficient of 
second records on first. There are two reasons for this. First, b is 
very simple to calculate and, second, it is an unbiassed estimator of p 
despite any culling that may have been based on the first records. 
However, when o; = o; and the first records of all cows are available, 
whether or not they have second records, b is not the maximum likeli- 
hood estimator of p. The variance of the second records about their 
regression on the first estimates o7(1 — p’) and the variance of all the 
first records estimates o°. These two estimators can be combined to 
give an estimator of p. Maximum likelihood makes use of this in- 
formation as well as the information given by b. 

We shall derive the maximum likelihood estimators of p and o’. 
We shall show that the efficiency of b as an estimator of p, relative to 
the maximum likelihood estimator, is fairly low for values of the various 
parameters that may well occur in practice. The bias of the maximum 
likehhood estimator is shown to be small. This suggests that the 
maximum likelihood estimator may be worth calculating. The com- 
putations involve only the solution of a cubic equation. A section is 
devoted to an approximate check of the assumption that Cao 


REPEATABILITY AND HERITABILITY 555 


Attention has been confined so far to the estimation of repeat- 
ability. The methods to be discussed could also be applied to the 
estimation of heritability from parent-offspring records. The assump- 
tion of = o3 means that, had there been no selection of parents, the 
variance of the parents and of the offspring would have been equal. 
In heritability studies, the maximum likelihood method makes use of 
information about animals that are not parents of animals which also 
have records. In milk yield studies, this would include information on 
dams that have only male calves. 


2. THE MAXIMUM LIKELIHOOD ESTIMATION OF REPEATABILITY 


All the cows have first lactation records but they do not all have 
second lactation records. We shall assume that the probability that a 
cow has a second record depends on its first record but not on any other 
character correlated with the second record. This rules out culling 
based on information about relatives or on characters such as per- 
centage butter-fat. However, the effect of such culling on the estimates 
will often be very small and could probably be safely ignored. 

Let N cows have a first record and n < N cows have a second record. 
Numbering the cows with a second record 7 = 1, 2, --- , n and the 
cows without a second record? = n + 1,n + 2, --- , N, the first records 
can be written 


Yair (Peale 2 NV) 
and the second records 
Yi2 @ = erp i ,m). 


The WN first records are normally distributed with mean py, and variance 
o. Because the culling is based only on first records, the distribution 
of a second record y,2 , given the first y,;, , is independent of the dis- 
tribution of the first record and is normal with mean p. + p(yi1 — mm) 
and variance o (1 — p’). The likelihood of all the records is therefore 


= 2 
DS (Yar i) 
a=1 


Puen Oe 2 27 
1 De [Yis a Ob a ae p(Yar a iy) ] 
ARC roa ae Zee) 


and the log likelihood is, apart from a constant, 
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nea — ae”) 


Ino = 5 ln(l =p) 


N n 
(Qaeee p) ye CESS fis) a ye (hie hae pYa — ws) ]° 
= 7) 
Kempthorne and von Krosigk (Henderson, Kempthorne, Searle and 
von Krosigk [1959]) suggested the use of maximum likelihood to 
estimate repeatability from records subject to culling. They derived 
the log likelihood when the cows were classified into groups and when 
the number of records for each cow was perfectly general. They derived 
very lengthy equations from which the maximum likelihood estimates 
of the parameters could be determined. In this paper, we are studying 
in greater detail the special case when there is no grouping of the cows 
and when there are at most only two records per cow. 
(2.1) can be written 


(2.1) 


~ (n + N) 2 n 2 Ns° 
InLb= — 5 Ineo NP) a 
_ nlp Si + 22 — 2p2Zrs) 
20(1 — p 
fe N@ = th) = Go = Ba pO Ha] 
Qo 2o0°(1 — p) 
where 
N n 
; dX (Yar gy) ; a (Yin aa 9) 
= a SE ; == = ; 
' 2d (Ue Jo) oS a Gi) Yie — G2) 
wy? = sy = i=1 
Coli n ) “12 a 5 
N n n 
eB Ya De Yar XE Yi2 
21 Side eee Ss eee Z 
Yy N i} Yi and Y2 =— 1 


The maximum likelihood estimators of », and mu, are clearly given by 


fi = 9 and pf, = Jo — AY: — sm). 


Also 
a(n L) x MASE i Ns" 4 np? Si PE) 
d(o°) Dan 2Qo0° 2e,(1 = p) 


. (2.3) 
Y= hh i nGo pee es ptr ro Mi) } 
ae 2o4 ae 2e°(1 rs es 
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and 
d(1n L) ye) ae nip(2; + 2s) — (1 + p ) Zl 
dp l—p (1 — p) 
MG. — m)[G2 — bo — pr — b)] 
i: oe (2.4) 
ae NplG2 — be — PGs = ia) ] 
ge (1 = 9) i 


By setting (2.3) and (2.4) equal to zero and substituting the maxi- 
mum likelihood estimators of », and ps , f satisfies the cubic 


(Ns — n>)p* — (N — n) 22h" 

+ [(m + N)Zi — NG’ — 23)]6 — M+ N)Zyp = 0. 
This equation will have one and only one root between —1 and +1 
of the same sign as the simple estimator b = %,,/2.. There may 


be two other roots between —1 and +1 of opposite sign to b. The 
formula for é is 


(2.5) 


ee OZ a tp) a 


v A WC) 2.6 
a = #9 Me 
When there is no culling, s> = =? and 
A ee 
* t+ 


When oj = co; , the maximum likelihood estimator of the correlation 
coefficient has the arithmetic rather than the geometric mean of the 
two sample variances in the denominator. 

To obtain the asymptotic variance of f, we need the expected values 
of the second-order partial derivatives of In L with respect to m, , Me , p 
and o*. These can be derived from a knowledge of the expected values 
of s’, 23 and 3,,. 2%? depends on the method of culling and so will 
be taken as fixed. For reasons to be given later, we shall calculate the 
expected values when of # o3. Clearly, E(s”) = (N — 1/N)o?.. By 
writing 


Yie = Me Je a SMe ON boa aor , 
1 


so that e;., has a standard normal distribution independent of yi , 


n 


ae ee eke 


1 
UDF 


325- = 
head FO 


and, therefore, for given 2j , 
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c oat —]1 
gat) = 22 3? + S—Y gra — 9. 


6 n 

Similarly, 
E(y) = “2 x. 
01 

Also, with o? = 02, EG — m)’ = 0° /N, Els — He — eG — )) = 
o(1 — p)/nand E{({, — m)[G2 — be — eG. — t)|} = 0. After con- 
siderable algebra the asymptotic variance—covariance matrix for fi , 
2 , 6 and @ is found to be V, where 


[N Hs ieee ope 0 _ nei = tn) 
eo ele) el) a aes 
ae ee te 0 mh — ey 
st eee aoe Gea CD 
n+N i: Meee 
0 0 Ge o (1 a p ) 
| eG, — mn) RG a) 8 
es) et eae) 
and 
ne n tl med pe nie) 257] aie mG. = bs)”. 
eigen ok Pes be o(1 — p) 


The asymptotic variance of / is 
HOE (N + m)( = 9%)? | 
m1 nN +n) = p°)21/0° + 2Np"] 


Writing n/N = S,so that (1 — S) is the culling intensity, and 
x?/c” = 1/c, where c measures the effect of the culling on the variance 
of the first records and is therefore a possible measure of the efficiency 
of the culling, 


ya (= S)\ io eee 
mi (1+ 8) — ep) 2 
(SSeS as] 


We shall now derive the approximate bias in # as an estimator of p. 
By substituting 6 = p + 64 in (2.5) and ignoring terms in 6 and 6, 


rel (Ns*—n2i)p° —(N—n) 2p +[(n+N)2i—N(s’— 22) ]o—M+N) 22 
3(Ns’ —n2,")p’ —2(N —Nn) Diop + [(n + N) 21° —N(s°— 2; J] 
(2.8) 


V@é (2.7) 
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Approximating (5) by taking expected values separately in the numer- 
ator and denominator, 


oe p(l — p*)o%(n — N) | 
Zin(n + N)1 — p°) +o [m — N) + p'(QnN + N — 8n)] 
To order 1/n, 


E(8) = 


a ee 
oe 3 24" | 


Cc 


(2.9) 


Therefore, from (2.7), the approximate ratio of the bias to the standard 
deviation is 


E(6)— pl — S) 


< lS Mes | ety tv. 


If n is reasonably large, the bias in # is unlikely to be serious. The bias 
will become relatively more important when estimates of p from different 
sources are pooled. 


3. ESTIMATION OF o? AND o2 WHEN o? # @? 
In the previous section we derived the expected values of s’, 2} 
and 2,, for given 2; when oj ¥ o;. They were 
N =~ il 2 
Nios? 


EG). = 


2 2 
ney = 23 p+ 2—D ata — 0, 


and 


po2 32 


E(243) = Te Fi . 


Apart from the usual slight differences in the multipliers, important 
only for small n and N, the following estimators of o; , 8 = po2/o, and 
o2(1 — p’) are the maximum likelihood estimators when o; # Pe 


é: = Ns’/(N — 1), 


ist =b= Dia) 24 , 
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and 


[Ze am Die/ Zi]. 


From the expected values of s*, 23 and 2, given in the previous 
section these estimators are all unbiassed. The maximum likelihood 
estimator of 03/0; is 


a : - 
2 2 = Z 

(@) = 2 4 ae 3.1) 
O71 “a1 Ss 


No exact method is available for constructing a confidence interval for 

02/02 except when n = N, i.e., when there is no culling (see Curnow 

[1957] for references and for a method to be used when the data are 
ga 


erouped). The asymptotic variance of (3/07) could be derived and 
used to provide an approximate confidence interval. This would give 
some indication of the importance of the assumption that o; = a . 
A simpler, but much less sensitive, test of whether o) > a, could be 
based on the fact that the quantity 

(N — 1)no; Ye — Zie/ Zi 


N(n — 2)o3(1 — p) - s° . 


which compares a x -value from the variation of the second records 
about their regression on the first with a x*-value from the variation 
of all the first records, has an F-distribution with n — 2 and N — 1 
degrees of freedom. 


(N — 1)n 3 — 2ie/2i 
N(n — 2) s 


significantly greater than F = 1 would suggest that o3(1 — p’) > a7 
and, therefore, that o5 > oj. 


Fs 


4. THE EFFICIENCY OF THE SIMPLE REGRESSION ESTIMATOR 
OF REPEATABILITY 


The statistic b = 2/2} is very easy to calculate and is the one 
generally used to estimate repeatability. It is the only possible esti- 
mator if first records are available only for those cows also having second 
records. Providing that culling is based solely on the first lactation 
yields, b is always an unbiassed estimator of 62, = po2/o, , but an un- 
biassed estimator of p only when oj = o;. In this paper we are assuming 
go, = o,. The variance of 6 is 


VG) 56 nS sey (4.1) 
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From (2.7), the asymptotic efficiency of 6 relative to the maximum 
likelihood estimator, 4, is therefore 


Eff. = CisE TG eye nf 
a Sp Ren 

The values of this efficiency are shown in Table 1 for various values 
of S, p and c. 

The following considerations suggest that ¢ is unlikely to be greater 
than c = 2. Let the N cows be reduced to LN by accidental factors 
uncorrelated with the level of yield. Then a proportion P=n/LN = S/L 
can be selected on the basis of first lactation yield. Assume that the 
selection is of the proportion S/L of the herd having the highest yields 
and that n and LN are sufficiently large that the effect of the selection 


TABLE 1 


Tue Asymptotic ErricIENCY OF THE SIMPLE ReGreEssion ESTIMATOR OF 
REPEATABILITY RELATIVE TO THE Maximum LIKELIHOOD ESTIMATOR 


Overall Selec- OQ = Gag 
tion Intensity Repeatability 
(S = n/N) (p) 1/2 1 4/3 2 4 
1/4 0 1.00 1.00 1.00 1.00 1.00 
1/4 0.95 0.90 0.88 0.82 0.70 
1/2 0.79 0.65 0.58 0.48 0.32 
3/4 0.49 0.33 0.27 0.20 Os 
1 0 0 0 0 0 
1/2 0 1.00 1.00 1.00 1.00 1.00 
1/4 0.96 0.92 0.89 0.85 0.74 
1/2 0.82 0.69 0.63 0.53 0.36 
3/4 0.54 OU. BY 0.30 0.23 0.13 
1 0 0 0 0 0 
3/4 0 1.00 1.00 1.00 1.00 1.00 
1/4 0.96 0.93 0.91 0.87 ORE 
1/2 0.84 0.72 0.66 0.57 0.40 
3/4 0.58 0.40 0.34 0.25 0.15 
1 0 0 0 0 0 
1 0 1.00 1.00 1.00 1.00 1.00 
1/4 0.97 0.94 0.92 0.88 0.79 
1/2 0.86 0.75 0.69 0.60 0.43 
3/4 0.61 0.44 0.37 0.28 0.16 
1 0) 0 0 0 0 
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can be approximated by the truncation of the top P proportion of an 
infinite normal population with mean py and variance o. The variance 
of the selected first records will then be 


Y= of1 —r»v —T)], 


where vy = Z/P and Z and T are the ordinate and abscissa at the point 
above which lies a proportion P of the population (Finney [1957] derives 
this formula and provides a table of values of y’ = p(v — T) for various 
values of P). The division of the overall selection intensity, S, between 
L and P is never complete and is certainly never known exactly. How- 
ever, P is generally considerably larger than S, 7.¢., there 1s a large 
amount of random culling and not so much actual selection. The 
value of ¢ is 


For various P, it takes the following values: 
P-= 0:05 400. L087 0.70.6) oe, 
124° [AO 72" 22.03 42.87 Oe oso 


C 


In practice, P is unlikely to be less than P = 0.7 and therefore c is 
unlikely to exceed c = 2. Table 1 does include ¢ = 4 to illustrate the 
effect of intense selection based on yield and ¢ = 3 to illustrate the 
effect of a selection scheme that results in an increased rather than a 
decreased variance. The efficiency is the same for p as for —p and so p 
is shown as positive in the table. In practice, p is very unlikely to be 
negative. 

When p = 0, the efficiency of the simple regression estimator is7 Ll. 
When p = 1, it is 0. For intermediate values of p, the efficiency in- 
creases rather slowly with S for fixed c, but decreases fairly rapidly 
with increasing c. For all values of c and S, the efficiency is very low for 
high values of p. As an example of the efficiency likely to occur in the 
estimation of the repeatability of lactation records, the efficiency is 
0.66 when c = 3, p = }.and S = #. In this case, the maximum likeli- 
hood estimator would certainly be worth calculating. 


5. THE ESTIMATION OF REPEATABILITY BY AN ANALYSIS 
OF VARIANCE 


Sometimes the effect of culling based on the first records is ignored 
and p estimated from a least squares analysis of variance of the N + 
first and second records. Table 2 shows this analysis of variance. 
The “cows and periods” sum of squares has been split into both “cows 
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adjusting for periods and periods ignoring cows” and “cows ignoring 
periods and periods adjusting for cows’. By a period difference is 
meant an average difference between first and second lactation records. 
The expected values of four important mean squares are shown. A, , 
A, and A; are defined at the foot of the table and vanish only if the 
culling is random with respect to the first records. Two cases need to be 
distinguished. In the first we assume », = uw. and in the second we do 
not. We shall use the reference numbers at the right of the table. If 
M1 = M2 , Mean square 2 and a mean square obtained by pooling sums 
of squares 3 and 4 are used to estimate o” and p. This is exactly equiva- 
lent to an analysis of a one way classification with unequal sub-class 
numbers (Snedecor, [1956], $10.16). If u, 4 uw. , mean squares 1 and 4 
are used to estimate o and p. There seems to be little justification 
for using mean square 2 with mean square 4. One or other is biassed 
or inefficient according as p, # pe OF wy = Mo. 

The bias in any of the above methods is difficult to determine without 
some knowledge of the effect of culling on the expected values of 
a —{%—1)/njo, 7. = 4, Gi ed) — 6 /nand Gy = in)g — 0 /N. 
However, these unknown biasses are clearly undesirable. Since the 
method of estimation described in this paper is the maximum likelihood 
method whether or not the culling is random with respect to first 
records, it is to be preferred. If culling is random with respect to first 
records, c can be substituted asc = 1 in the formulae for the asymptotic 
bias and variance of the maximum likelihood estimator and the asymp- 
totic efficiency of the regression method. 

Wadell [1961] has given a method for estimating p when the culling 
is equivalent to truncation of the first lactation yields. His estimate 
of p is a function of 9; , 2 , s°, D{ and the mean squares 2, 3, and 4 of 
Table 2. The estimate has a negligible bias for large values of n but 
its variance is not given and therefore its efficiency is unknown. 


6. A NUMERICAL EXAMPLE 


Dr. A. E. Freeman of the Department of Animal Husbandry, lowa 
State University has kindly made available to me some Complete Herd 
Improvement Registry Records of an Iowa Board of Control herd of 
Holstein-Friesian cattle. The records included some first and second 
lactation yields made in various years but expressed as deviations from 
the herd average for each particular year. The cows were milked 
twice daily and their yields expressed as mature equivalent 305-day 
lactation yields. 

The values of the various relevant statistics for a sample of the 
records were: 
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TABLE 2 
ANALYSIS OF VARIANCE OF First anp Seconp Recorps 
2 r z * eas Ref. 
Source df Sums of Squares Expected Values of Mean Squares Nos. 
j N—2 
Oh) a — po 
ee xh N—1 | 5 (34+ 22. — 21) + Ne 1 
eriods 2 P , ie ed 
se a(N — 1) (p + 2p 1) A, 
Periods SN Saas 
ign. Cows ; N-+n G G2) 
2 (23 + 2212 — 2) + Ns" 
Cows and N 3 
Periods nN 
ste ae 7] 72 UB) 
N+n 
o(1 — p) 
ay. (n+ NY —(N+3n) 2» 
Geekays 2 (32+ 2212 — Zi) + Ns (WW — DN +2) : 
Periods ‘stra ar A ; nN = 7) Gea 
x = SNOT acute pe CY ANG ce \ 2 
Ste Noe Ge 8) aS (G. — Go) 2(N — 1)(N +7) 
n 
Se ty 
ae Aa 
eae L 2 Ch — Ye) a (1 — p) te (2 — Edy + n(1 — p) As 3 
; : OLS De 
Error | Ae = 2212 + 2; a (1 — p) + O(n i), Ay 4 
ze <0) Se es See 
TOTAL |N+n—11 n=, + Ns + Ware @ Yo) | 
= St 
Adj. for 1 (NG + ng)” 
mean N+n 
[re eae es 2 
n 
1 1/N—n7 , a 
Ay = 3 +2p—1)4,+ 5 Coat p + 2p — 1)(D —o'/n) 


A; = 3(1 — p)(D* — a? /n 


N = 220, 
pS ae: 
Ss =1.286:50, 


_2Np (png — @? 'N) 
=i cee (Dy — « /N) 


)p 


N—n 
ae (Con Pitz + M2 — bi F Ph 


) = Ys — 2) D and D=%J—pm. 


n= 150, S = 0.682; 
9. = 1.147, G = —0.207; 
3S? = 270.51; 2. = 125.85 and 2; = 354.98: 
Pee 


The maximum likelihood estimator of 03/07 (3.1) is (3/07) = 1.25. 
This value is almost certainly not significantly greater than 03/0, = 1. 
However, it is sufficiently large to raise some doubts about the assump- 


tion that o3 


o, . The estimate of o3(1 — p’) is greater, but not 
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significantly greater, than the estimate of o? . However, for illustrative 
purposes we shall assume o3 = o7 . 
Equation (2.5) for A is 


p — 0.3922A° + 5.12576 — 2.0728 = 0. 
The simple regression estimator of p is 
b = 21./2; = 0.465. 


Three cycles of an iteration based on formulae (2.8) and with pon a0, 
as the initial solution show that the value of # is, to three decimal 
places, 

6 = 0.404. 


This is the only real root of the maximum likelihood equation for #. 
The two estimators together with their estimated standard errors 
[(2.7) and (4.1)] are therefore: 


b = 0.465 + 0.084 
and 
p = 0.404 + 0.069. 


The latter standard error is an asymptotic standard error. The esti- 
mated efficiency of the regression estimator is only 67 percent and so 
é is probably well worth calculating. The estimated asymptotic bias 
(2.9) is very small, 

Lo 

E(6) = 0.0004. 


The maximum likelihood estimator of o” (2.6) is é? = 314.46. This 
agrees well with the value s’ = 286.55 used above in estimating (6) 
and the standard errors of b and #. 

The estimated value of c is 

b= SIH, = WAU) 
This value of c suggests that there has been very little selection based 
on first lactation yield in this particular sample of records. The 
standardized selection differential 

(Ga 7)/8 = 0184 
suggests that the effective selection intensity P was near 0.93. 


7. SUMMARY 


The maximum likelihood estimation of repeatability from first and 
second lactation records of a herd subject to culling is discussed. The 
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method is applicable only when it can be assumed that, had there been 
no culling, the variances of the first and the second lactation records 
would have been equal, and when all the first records are available 
whether or not the cow has a second record. The efficiency of the more 
usual estimate of repeatability, based on the regression of second records 
on first records, is shown to be low when the repeatability is high. In 
many cases the calculation of the maximum likelihood estimate would 
seem to be well worthwhile. The use of the method in estimating 
heritability is mentioned. An illustrative example is given. 
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THREE CLASSES OF UNIVARIATE DISCRETE 
DISTRIBUTIONS 


C. G. Kuartri ann I. R. Patet! 
M.S. University of Baroda, Baroda, India. 


1. INTRODUCTION 


Families of descrete distributions have been developed and studied 
by many authors, including, Neyman [1939], Feller [1943], Skellam 
[1952], Beall and Rescia [1953} and Gurland [1957, 1958]. These 
families are of three types: 


Type A: gu(z) = exp {h(@}, 
Type B: gas(z) = {h@)}", 
Type C: gc) = c log {h@}, 


where g(z) represents a probability generating function (p.g.f.) and 
h(z) isa p.g.f., except possibly for additive and multiplicative constants. 
The aim of this paper is to set up formulae for certain statistics for 
these types. It is hoped that these will be of use to reseach workers in 
practical fields, who will be formulating compound and generalised 
distributions of these types by using specific forms of h(z). 


2. RECURRENCE RELATIONS FOR PROBABILITIES 


Notations: Let the r-th derivation of f(z) be denoted as f‘”(z). 
Also let 


Peay Orly aos (1) 
and 
= {ni?(2)/r}} |-=o : (2) 


By the definition of g(z), it is clear that P, denotes the probability 
of the r-th count in g(z) and z, , the probability of the r-th count, 
excepting possibly for additive and multiplicative constants. 

Type A: Here 


ga(2) = exp {h@)}. 


1 Present address: Statistical Officer (I), Rajkot, India. 
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Successive differentiation leads to 


Oa Ss ie is a @al PO). (3) 


k=1 


Hence, on letting z = 0, we have 
Pes Ds kr,P,-./r with Po = exp (mo). (4) 
k=1 


Type B: Here 
gunz) = {h@)}”. 


Successive differentiation of h(z)gQ@(z) = ngs(z)h’(z) leads to 


2 Go. Oly ("= hese. © 


Hence, on letting z = 0, we have 
d age = 2 (nk rege ee =— k)r.P,-+/?To with Ie%. = To ‘ (6) 
k=1 


Type C: Here 
go(2) = c log {h(z)} where c = log {h(1)}’. 


Successive differentiation of h(z)g‘?’ (2) = ch“ (z) leads to 


nagr@ = ar@ — S(" > WnM@aePe. (7) 


k=1 
Hence, on letting z = 0, we have 


ai 
P, = {rer, — 2 (7 — k)n,P,2\/rxo with Po = c log a. (8) 


3. FACTORIAL CUMULANTS 
Notations: Let 


wiry = {h°? @} |e, (9) 

Mtr =F {g°(2)} z=1 ) (10) 
and 

Kix, = {(d/dz)’ log g@} |.-1 . (11) 


From the definition of factorial cumulants, it is clear that y/,, is the 
r-th factorial moment of h(z) if h(z) is a p.gf. and M‘,, and K,,, are 
respectively r-th factorial moment and r-th factorial cumulant of g(z).. 
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Type A: Here log {ga(z)} = A(z) and hence, 
Kir = Bir (12) 
Type B: Here 
o(2) = log gz(2) = n log hfe). 
Hence, on using (11), it is clear that r-th factorial cumulant of Jz (2) 


is n times the r-th factorial cumulant of h(z) if h(z) is a p.gf. Also on 
using (7) (with necessary modifications), (9) and (11), we have 


, F = (r—-1 a‘ 
Kin mee (2) OF | Sl ik MK (rx) ‘ (13) 


Type C: Here, it is easy to give a recurrence relation for factorial 
moments J/{,; rather than factorial cumulants. By using (7), (9) and 
(10), this relation can be shown to be 


a Sel 

Mi, = on = = k Jen Mnf (14) 
Pail 

where uf = A(1). The factorial cumulants can be obtained from (14) 

by using the relations 


Kg = iy, Gry ie Mii ) 
Ki 3) = Mr = 3M 1.,M is, + 2M? ) etc. 


4. SPECIAL CASES 


Notation: If a variate has either a Binomial or a Negative Binomial 
law as a special case, we say that it has a general binomial law. 

Type A: From the form of the Poisson-Binomial, Negative-Bi- 
nomial, generalised Polya Aeppli, Beall and Rescia [1953] and Neyman’s 
Types A, B, and C, it is clear that they belong to this form. It is to be 
noted, that, if in the classical problem of egg masses and larvae, the 
egg masses have a Poisson distribution with p.g.f. exp [A(z — 1)] and the 
- larvae within an egg mass, a distribution with p.g.f. w(z), then the 
distribution of the larvae over the whole field is exp [A{w(z) — 1}] 
which is of Type A with h(z) = A{w(z) — 1}. 

Some important distributions with their recurrence relations are: 

(i) Poisson-Hypergeometric distribution [1958]: Here 


wZ) = D (*eam'C DCE ea oHies) (15) 


r= 


where 


ORG ea ala ie I)(a si 2) ate (a ae eas iy (") = oak 
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ee = k(k = 1) vege (k = 7 + L);, CLE) = L, Ke = 1 and a, B, m 


and k are such that w‘”(0)/r! = 7, 1s positive. The above distribution 
was first given by Gurland [1958]. The recurrence relation in prob- 
abilities is 


Py X >) smP yr (16) 
s=1 


where Py = exp {A(ro — 1)}, and 
atBtr—2—ma—k + 2r — 3) 
te r(l1 — m) Recs 


lee Sher 2m 
rir — 1)(1 — m) 


Wr-2 4 (17) 


dio 


Ee De Hay(—B)cay'/slla 1) cog Gane ata as 


For the particular cases of the above distribution, we may refer to 
Gurland [1958]. The recurrence relation for factorial cumulants is 


Kir = Aa yk m'/(a + B) cr) . (18) 
(ii) Poisson-Power series distribution: 
we) = Diaz’, (19) 
i=0 


where a,’s are constants such that w(z) is convergent for some z. The 
recurrence relation for probabilities is 


P, = >> sa.P,-./r with Po = exp {A(a — )}. (20) 


The above distribution was first given by Maritz [1952]. 

Type B: Let the distribution of egg masses be a general binomial 
with p.gf. (1 — p + pz)". Then the distribution of the larvae over 
the whole field is [1 — p + pw(e)]", ie. the Type B distribution with 
h(z) = {1 — p + pw(z)}. When n = 1, this leads to the distribution 
with p.g.f. w(z) with an addition (or subtraction) of zeros, If ws 1 
or n < 0, this can be regarded as n-th confluent of w(z). 

Some important distributions with their recurrence relations are: 

(i) G. Binomial-Hypergeometric distribution: Here w(z) is the same 
as given in (15). This was first stated by Gurland [1958] in the special 
case when n < 0, p < 0. The recurrence relation in probabilities is 


Po= pn Dz: (ns — 7 + 8)1,P,-./TA , (2H) 
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where Po = ah , & = 1 — p + pm and 7z,’s are defined in (17). 

(ii) G. Binomial-G. Binomial distribution: Here w(z) =(1—m-+ mz)’. 
Hence, 7; = m(k — i + 1)a,1/i(11 — m), with m = (1 — m)‘, 
a =1—p-+ pr and Py = ai. The recurrence formula has the same 
form as in (21). 

(ili) G. Binomial-Poisson distribution: Here w(z) = exp {A(z — 1)}. 
Hence, 7; = ‘mo , T. = exp (—A), & = 1 — p+ pm and Po = ay. 
The recurrence formula has the same form as in (21). Whenn = 1 and 
p{1 — exp (—d)} = 4, this was called by A. C. Cohen [1960] an ex- 
tension of a truncated Poisson distribution. 

Type C: Let the distribution of egg masses be a logarithmic dis- 
tribution with p.g.f. log (1 — \z)/log (1 — d). Then the distribution of a 
larva over the whole field will be log {1 — Aw(z)}/log (1 — A), 1e. the 
Type C distribution with h(z) = 1 — dw(z). The important distribu- 
tions are obtained by considering w(z) as hypergeometric function 
G. Binomial, Power-series etc. 


Choice of a distribution under the condition of no migration: 


Let us suppose that the different sites of a colony are distinct and 
countable, and let there be no migration between sites of a colony. 
Assuming the same probabilities of arriving at a particular site by any 
organism, the distribution of r (when r is fixed) organisms in a particular 
site is Binomial. Now let us suppose that one or more organisms 
arriving at the colony follow the truncated Negative Binomial law. 
(This may be true under the wide applicability of the Negative Binomial 
in biological data, e.g., Bliss [1953], Evans [1953]). Then, it is easy to 
show that the p.g-f. of the organisms in a particular site (when there is 
no migration) is 


1 6 elm, — ma — + m/l — Cam) 


here 70pm Oy 0s ue, bp pl my = mz)" 
forrO0 <p < {1—(U+m)"}",m > Oand k, > 0. Now suppose 
that the independent results of n sites are combined together. Then 
the p.g.f. of the distribution of organisms is 


Eee By ita Dp lacks mi mz) “*]” 


which is a particular case of G. Binomial-G. Binomial. The above 
distribution can be named as Binomial-Negative Binomial. 


5. METHODS OF ESTIMATION FOR G. BINOMIAL-G. BINOMIAL 


Here the p.gf. is [1 — p + p(l — m+ mz)". 
Method of moments: Let T = ay Ka, ee Keane. 
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S = K?,,/Ki2;. The approximate value of 7 is obtained from 

n(TS — 2R? + R) + nS(TS — 6R + 6) + S(R — 2) = 0. 
Then k, m and p are estimated from 

k= {S? + nS +72 — R)}/n{n(l — R) — S$}, 
(nK 2) + Kiy)/(k — InKy, and p = Ky,/knm. (22) 


m 


Method of maximum likelihood when n and k are fixed: 


Similar to Sprott’s results [1958], we have the maximum-likelihood 
equations as 


F=nkpm and L(m) = 2a,G(r) —-N =0 (23) 


with L'(m) = (d/dm)L(m) = DS a,G(r)Pr*{1 — kd — py} - 
{1 + (1 — m)[mk(1 — p)]'h "FA G(r)}], where 7 is the sample mean, 
a, is the frequency at the r-th count, N is the total frequency, 
G(r) = (ry + 1)P,4:/P,7, A G(r) = Gr + 1) — G(r) and m, 6 are 
maximum likelihood estimates. 

From the first approximate estimates #’, ’, the new corrected 
values j’’, 7’’ are estimated from 


ml! = a! — {L(h’)/L'(h’)} and fp" = F/nkm’’. (24) 


Sample zero frequency when n and k are fixed: 


Here the estimates p and m are obtained from 
a = N[l —p+p(_ — m)‘J* and * = nkpm. (25) 


It may be noted that when n = 1, the two equations in (25) are the 
same as those in (23). 


6. EXAMPLE 


In order to illustrate how the above discussion can help an experi- 
menter in the field of curve fitting, we fit here Binomial-Negative Bi- 
nomial to the data in Distribution 1 of MacGuire et al. [1957, Appendix]. 
From the data, the first four factorial cumulants are Ky, = 2.5900156, 
Ki; = 0.6877630, Kis; = 0.0218497, K,4, = 0.9080044. 

Hence, on taking n = 1, the solution of k, by the method of moments, 
correct to first decimal place is —12.0. Then the various estimates 
of mand p are m = —0.2204963, p = 0.9788582 by the method of zero- 
cell frequency or maximum-likelihood and m = —0.2196584, p = 
0.982591 by the method of moments. 
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Die 


The fits are shown in Table 1. The fits of other distributions are 
given along side for reference purposes only. 


TABLE 1. 

Binomial-Neg. Binomial 
Count Obs. Negative | Poisson; | Poisson 
per fre- by M.L. or | by method | Binomial | Binomial | Power 
plot quency zero fr. of moments [1957] {1957] Series’ 
0 355 355.000 346.445 324.30 341.84 | 339.072 
1 600 622.478 628.153 660.37 644.37 | 645.036 
2 78i =| =730.994 735.340 734.06 728.03 | 730.150 
3 567 616.306 618 .023 610.45 609.14 | 610.886 
4 441 417.545 417.393 408 . 82 415.60 | 416.079 
5 245 241.296 240.550 236 .54 242.72 | 242.339 
6 135 123 .567 122.747 122.49 125.17 124.532 
fii 42 57 .406 56.846 58.12 58 .20 57.655 
8 17 | 24.632 24.315 25.68 24.76 24.417 
9 | 11 9.395 Sy, Ue 10.70 9.75 9.567 
10? i 7.301 5.457 4.47 5.42 5.267 
x2 with 19.184 20.294 34.52 25,02 25.939 
videE: hiadte 7 sGlate 8.df. 8.d4 Sadat 


2Expected frequencies are from 10 and above 
3Probabilities are calculated from the p.g.f. exp { —(a +b) +az + bz} where a = 1.9022526 and 
b = 0.3438815. 


7. ASYMPTOTIC EFFICIENCIES 


Here we give the asymptotic efficiencies for the various methods 
of estimation for m and p only ina G. Binomial-G. Binomial distribution. 

The determinant of the information matrix up to order N* for the 
maximum likelihood estimates m and p is 


Dn.p = N’n*[npk{kd — p) + — m)m"}R 
aw) = Las 


where R = —1+4+ >> G’(r)P, and G(r) = (ry + 1)P,4:/nkpmP, . 
The determinant of the covariance matrix up to order N~* for the 
moment estimates of m and 7p is 


D = (KK, + 2K2 — K3)/[Nk(k — 1)nmK,}’, (27) 


(26) 


where 


Ki = kn mp, K./K, = 1+ (k — 1)m — (K/n), 
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K,/K, = 14+ 38h — m+ (&— DE 2)m? — 3(K2/n) — (Ki/n’) 
and 
K,/K, =1+ 7k — lm + 6 — Ik — 2)m’” 
+ (k — I)(k — 2)(k — 3)m* — 4(K;/n) — 6(K,K./n’) 
— (Kj/n*) — 3(K2/nK,). 
The asymptotic efficiency in the restricted sense is 
B= Dep (28) 


The determinant of the covariance matrix up to order NV ~? for the 
sample zero frequency estimates of m and p is 
D(m, p) = maf{a(1 — Po) — PoKi}/N’an’KiPo ; (29) 
where @ = 1—-p + pil — ma = K/K, Bo — a, and 
@, = 1— (1 — m)* — km(1 — m)*"*. The asymptotic efficiency with 
respect to the method of moments is 
E, = D/D(m, p). (30) 


It may be noted that H, = Ey’ whenn = 1. 


TABLE 2 
Asymprotic Erriciency Z, RELATIVE TO THE Mrtuop or MoMENTS 
WHEN n = Llanpdk = —1. 
m 

p{l —(1 — m)*¥} = 0 —0.5 —-1 —2 
0.3 1.492 1.857 ~ 2.193 
0.6 il Keulal 74045) 2.630 
0.9 2.444 4.000 4.944 
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FURTHER CONSIDERATION OF METHODOLOGY 
IN STUDIES OF PAIN RELIEF 
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AND 
Spencer M. FRx, JR. 


Smith, Kline, and French Laboratories, 
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INTRODUCTION 


In an earlier paper [1] we challenged the prevalent view that in 
comparisons of pain relieving drugs it is always desirable to have “each 
patient act as his own control,” i.e., to test more than one drug on each 
patient and to estimate treatment contrasts from within-patient differ- 
ences. We sought data of other investigators to compare with our own, 
but we were unable to find references which gave sufficient detail to 
permit investigation of the merits of alternative designs and analyses. 
It seemed desirable, therefore, to put our own data on record so as to 
facilitate discussion of the issues raised. This we did, and the response 
has been gratifying. 

This paper presents and discusses some of the suggestions and 
comments of those who wrote to us. We are particularly indebted to 
Frederick Mosteller, Department of Statistics, Harvard University, and 
to Robert Curnow, A. R. C. Unit of Statistics, University of Aberdeen, 
Aberdeen, Scotland, who, in addition to making numerous thoughtful 
comments, performed their own analyses of our data. 


A SIMPLE EXTENSION OF THE MODEL 


As stated in our earlier paper, this study of relief of post-operative 
pain was designed as an incomplete block experiment. Each patient 
was given a dose of a test drug, and the dose was repeated when the 
patient again complained of severe pain. The number of hours of 
“oreater than 50 percent pain relief’? cumulated over both intervals 
was the measure of drug efficacy. A second drug was tested on each 
patient in the same way, starting at the time when the patient again 
complained of severe pain. Three drugs, two levels of a new drug and 
one level of Demerol, were under study. 
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It was found that the total hours of relief achieved when a given 
drug was the second administered was larger on the average than when 
that drug was administered first. This is consistent with the presump- 
tion that the pain decreases with time after operation. However, it 
invalidates a straightforward least squares analysis based on the model 


hours of relief = grand mean + patient effect 
+ drug effect + random error. 


We considered the possibility of extending the analysis to include a 
simple time-period effect, representing the average differences in relief 
between the second and first periods, but for two reasons we did not 
pursue it. Firstly, and most importantly, it seemed to us that such a 
model would be unlikely to be an accurate reflection of the true situation. 
For example, if the duration of drug effect increases with time after 
operation, the period effect should be greater when the second drug is 
given after a potent drug than when it is given after a weak drug. 
Secondly, our object in the first paper was to compare simple alter- 
natives for the design and analysis of studies to evaluate analgesics, 
and a procedure requiring adjustment for time period as well as patient 
effects did not seem attractive as a procedure to recommend to 
clinicians for routine screening of new drugs. (In this connection one 
should remember that it is almost always necessary to discontinue study 
of some patients, often for reasons unrelated to drug response. Thus, 
balance in design is hardly ever achieved.) 

Curnow extended the least squares analysis of our data to take 
account of such a time-period effect. The model becomes 


hours of relief = grand mean + patient effect + drug effect 
+ time period effect + random error 


and the corresponding analysis is given in Table I. He stated that the 
differences due to the drug which follows a potent one being administered 
rather later than one which follows a weak one were small and had little 
effect on the estimates of drug differences. Thus the internal evidence 
of the experiment itself does not seem to confirm the fears expressed 
above, and the least squares analysis that includes a time-period effect 
appears to yield a reasonable description of the data. In particular, 
in both the inter- and intra-block analyses, the between-group residual 
mean square is now not larger than the corresponding within-groups 
mean square. 
Curnow went on to point out that the inter-block error is not much 
larger than the intra-block error, and that one might on this ground 
justify an analysis ignoring patient differences, as shown in Table II. 
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TABLE I 
Least Squares InTRA-BLOCK ANALYSIS 
Source D.-F. 5.8. M.S 
Blocks 42 607 .256 
Periods and Drugs 3 378.015 
Periods ignoring drugs 1 261.628 
Drugs adjusted for periods 2 116.387 58.194 
Between Groups Residual 3 3.051 1.017 
Within Groups Residual 37 428 .945 11.593 
Total 85 1417 .256 
DruG CoMPARISONS 
Comparison Mean Difference Variance t 
D vs. Ti 3.421 1.178 3) 5 115) 
D vs. Ts 2.037 1.061 1.98 
T3 vs. Ti 1.384 1.025 135 
TABLE II 
Least Squares ANaLysis IgNnorRING PATIENT DIFFERENCES 
Source D.F. 8.8. M.S. 
Periods and Drugs 3 434.368 
Periods ignoring drugs 1 261.588 
Drugs adjusted for periods 2 172.780 86.390 
Remainder 82 982.888 11.986 
Total 85 1417 .256 
Druc CoMPARISONS 
Comparison Mean Difference Variance t 
D vs. T1 3.548 0.874 3.80 
D vs. T3 1.861 0.831 2.04 
T; vs. Ti 1.687 0.816 1.87 
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This observation suggests that in future experiments conducted under 
similar conditions we should not arrange matters to eliminate patient 
effects, if it will cost us much to do so. 

Our own calculations and scatter diagrams confirm Curnow’s re- 
sults. However, in view of our first argument against the use of a 
simple time-period effect we found them surprising and made some 
further investigation. We take note first that the fact that a model 
“‘fits’’, in the sense that certain internal checks are satisfied, does not 
insure its correctness. If, for example, second-dose relief were more 
variable than first-dose relief, an examination of residuals would not in 
this case establish that fact; the estimated first-dose residual for a 
given patient has precisely the same magnitude as the estimated second- 
dose residual for that patient when we use the present model. The 
second-dose relief scores are, in fact, more variable than are those for 
first doses. This may be observed by examination of Table 1b in [1] 
and it is reflected in the fact that the within patient residual variance 
(Table 3a), which excludes the component of variability due to patient 
differences, is actually larger than the between patient variance for first 
dose relief (Table 2). 

If the facts are in accord with the assumption that the increment 
in relief with increasing time after administration is proportional to time 
after operation, the effect of the model having only a simple time-period 
effect would be to assign a part of the drug difference to the time-period 
effect and thus to reduce the measured differences between drugs, as 
compared to an analysis for the first period only. In fact, the difference 
between the most and least potent drugs (D and 7) estimated by use 
of the extended model is less by about 15 percent than the estimate 
obtained from analysis of the first period only. 

Granted, then, that the model with a simple time-period effect 
fits the data insofar as intra-block checks are concerned, what may we 
expect to gain or lose if we go ahead and use it? We do have some 
evidence which tends to contradict the assumptions of this model, but 
the effects do not seem large. We have some slight evidence that we 
may lose discriminating power by misinterpreting drug differences as 
time-period effects, but, as judged from these data, the reduction in 
variance may well compensate. Insofar as testing (i.e., deciding which 
drug is better) is concerned, or estimating potency in an experiment 
with a full range of standards, the analysis based on this model should 
- give essentially valid results. 

In any event, apart from the question of the best analysis for these 
data, the smallness of the between patients component in the extended 
analysis suggests that we would have achieved at least equal precision 
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for drug comparisons if we had kept each patient on a single drug 
throughout the experimental period. Had we done so, we would have 
had, in addition to possible benefits in precision, the comfort of closely 
imitating the ordinary clinical situation, and the advantage of an un- 
biased estimate of a clearly interpretable measure of effectiveness along 
with an unbiased estimate of its variance. We might also have had the 
cooperation of several more surgeons, some of whom, understandably, 
object to studies which require administration of several coded drugs 
to each patient. 


A REGRESSION MODEL 


In furtherance of the point of view that the cost of statistical work 
is generally small compared to the cost of gathering clinical data, 
Mosteller suggested that the data be analyzed in accordance with a 
model which might give a fairly realistic appraisal of the time-trend 
effect. In particular, he suggested that we not cumulate the two doses 
of each drug but analyze the data for single doses according to the 
following model. 


Yimad = 1oeR = (6. = Ven lama = €imd ) 


where 
a; = effect of z-th subject, 
Bm = effect of medication m when given at time zero, 
Ym = regression of effect of medication m on time of administration, 
tima = time at which individual 7 receives the d-th dose of medica- 
tion ™, 
€jmq = Yandom error. 


This analysis was carried out, using the times of drug administration 
(not shown, but available upon request) in addition to the data already 
presented. It was found that the regression coefficients y,, were quite 
close to one another and, if we restrict the model to the case of equal 
Ym , the least squares estimate of this common value is 0.20, correspond- 
ing to an additional hour of drug relief for every five hour interval 
between operation and drug administration. Using this restricted model 
and doubling the estimated effects to make these results comparable 
to those for our other analyses, we have the estimates of treatment 
differences shown in Table ILI. 

Assuming for now the correctness of the regression model, we see 
that these estimates are quite close to those obtained in the analysis 
using only the first drug for each patient—Table 2 of [1]. However, 
the variances are reduced by about one-third, so that the regression 
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TABLE III 
Drua Comparisons rrom REGRESSION Moprt ANALYSIS 


Comparison Mean Difference Variance t 
DN Stele 4.066 0.824 4.48 
D vs. T3 2.530 0.732 2.96 
T3 vs. Ty 1.534 0.708 1.82 


analysis, using all the data, appears to have about 50 percent greater 
precision than the “first drug only’ analysis, which uses only half 
the data. 

Comparing the apparent precisions provided by the above analyses, 
we see that Curnow’s intra-block analysis, adjusting for patients and - 
time periods (Table I) appears comparable in precision to the “first 
drug only” analysis. Curnow’s analysis ignoring patient effects (Table 
II) appears almost equal in precision to the analysis based on the 
regression model. 


OTHER COMMENTS AND ERRATA 


A third correspondent, Irwin Bross, Roswell Park Memorial Insti- 
tute, Roswell Park, New York, raised several points, in part overlapping 
those above. In addition he pointed out that the results of the “least 
squares” analysis (‘Table 4a in [1]) differ appreciably from the analysis 
based on simple linear combinations of intra-individual contrasts. 
Since the design is nearly balanced, closer agreement might be expected, 
even though our “least squares’ analysis fails to allow for time-period 
effects. In fact, contrary to the suggestion in [1], the “linear combina- 
tions” analysis is not really comparable to the “least squares” analysis. 
The reason is that in estimating, say, D — 7, , no account was taken 
of the information about this difference given by contrasting the direct 
estimate of D — TJ, with the estimate of 7, — T;. If we take an 
average of the directly observed difference, D — 7, , with the contrast 
obtained by combining D — 7; with 7, — 7; , weighting inversely as 
the estimated variances, and proceed similarly for the other compari- 
sions, we get Table IV, which agrees much more closely with the “least 
squares” analysis (Table 4a in [1]). As might be expected, the ‘‘ex- 
tended least squares’’ analysis which does allow for time-period effects 

‘(Table I) gives results in excellent agreement with Table IV. 

This revision leads in turn to a more precise combined intra- and 
inter-block analysis in place of that shown in Table 8c in [1]. The 
result (not given) is quite close to the “least squares’ combined analysis. 
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TABLE IV 
Druc CoMPARISONS FROM ANALYSIS OF WITHIN PATIENT CONTRASTS 


Comparison Mean Difference Variance t 
D vs. T; 3.449 1.206 3.18 
D vs. T; 2.051 1.073 1.98 
TES VS eels 1.398 12032 1.38 


Both Curnow and Bross pointed out that erroneous entries are 
shown in Table 4c in [1]. This table should be replaced with Table 4c 
(revised) as shown. 


TABLE 4c (revised) 
ComBINED INTRA- AND INTER-BLock ANALYSIS FOR DRuG DIFFERENCES 


1 ee ae 
w —— = 0.8518 Oy = 3720 > 


= 2 
1174 0.2688 


Calculation of Average Mean Difference: Drug D vs. Drug 7: 
(w)(3.150) + (w’)(3.900) 


=) 5.000. 
w+ w : 
Calculation of Variance of Difference 
1 il 
———— = ———_,,, = 0.892. 
w+ w’ 0.8518 + 0.2688 
Drug CoMPaARISONS 
Comparison Mean Difference Variance t 
IDY Sa UE 3.330 0.892 8) iv 
D vs. T3 1.843 0.848 2.00 
Secs vs. T, 1.442 0.828 Watts 


DISCUSSION 


The point which we wished to emphasize in [1] was that on account 
of the increase in duration of relief with time of administration, the use 
of simple intra-patient contrasts—‘each patient his own control’’—may 
not be optimal, and that an experiment in which each patient received 
only a single drug might be preferable. 
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The analyses proposed by Curnow and Mosteller have demonstrated 
that for this data a model taking account explicitly of the effect of time 
of drug administration will yield more efficient estimates than does the 
one-way analysis of the first period only. With respect to the analysis 
of the present data, both the Curnow and Mosteller models appear to 
fit quite well, and the estimates of the drug effects produced by their 
analyses are more precise than those derived from the analysis for the 
first period only. 

It is not clear, however, whether the above finding should be con- 
strued as evidence supporting the need for more complex models and 
analyses than are currently in vogue, or whether instead it is evidence 
supporting our original viewpoint. If, as the evidence of Curnow’s 
analysis suggests, patient differences are negligible, we would have no 
reason at all to use each patient as his own control. Were each patient 
given only one drug, the time-period effect of Curnow’s model would be 
eliminated from treatment contrasts and the one-way analysis—using 
now the whole of the data—should be fully efficient. 

Even if patient differences are not negligible, the panties of 
simplicity, both of design and analysis, may outweigh a small gain in 
efficiency which could be achieved with the use of a more complex 
design and analysis. With our data, for example, the estimates of 
greatest apparent precision were those provided by the regression 
model, and the gain in precision compared to the analysis of the first 
period only was 50 percent. Such a gain is, of course, quite worth 
having, but it is not overwhelming, and it is easy to believe that had 
our design assigned one drug to each patient throughout the period of 
study, the one-way analysis might have equalled the present regression 
analysis in precision. 

In conclusion, we must again emphasize that we do not claim that 
our findings apply to all kinds of pain studies. There may be many 
situations for which within-patient contrasts are far superior to between- 
patient contrasts. What we do claim is that no principal such as 
“each patient his own control’ is entitled to the status of dogma. 
In some situations, at least, the simpler methods are better. 
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SUMMARY 


This paper discusses the interpretation of frequency data when the 
series of observed frequencies is assumed to arise from a population in 
which the expected frequencies form a geometric progression. Such 
situations occur in the study of steadily increasing insect populations 
and similar phenomena, where the common ratio of frequencies is related 
to the rate of growth of the population. 

The estimation of the common ratio, and tests for the significance of 
departure from geometric trend, are discussed. Asymptotic formulae 
for the tail probabilities in large samples are determined. 

The methods and tests of significance are illustrated by application 
to some experimental data. 


I. INTRODUCTION 


Observations are often recorded by classifying them into several 
classes and counting the frequencies of occurrence in each class. In 
general, the expected frequencies will be partly specified by theoretical 
considerations, but will often also depend on one or more unknown 
parameters. 

The interpretation of such data then involves, firstly, testing its 
concordance with the assumed form of expected frequencies, and 
secondly, estimating the unknown parameters. Thus, for instance, in a 
steadily increasing population of organisms, the number of individuals 
expected in successive age-groups will be in geometric progression, the 
common ratio between the expected numbers representing not only the 
growth of the population but also the effects of mortality, migration 
and other factors. In such a study, one of the objects would be to test 
the concordance with the assumed geometric progression, and the other 
would be to estimate the common ratio accurately. 

In general problems of this kind, the parameters may be estimated, 
and the accuracy of the estimates assessed, by the method of maximum 
likelihood. To test the fit of the model to the data, the x” test is generally 
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appropriate. If hypothetical values of the parameters have been 
specified, as occurs in many practical instances, their concordance with 
the estimates derived from the data can also be tested by means of x’. 

In the particular case, with which this paper deals, of expected 
frequencies in geometric progression, there exist sufficient statistics for 
the two parameters involved, representing the general level of the 
frequencies and their geometric trend. This facilitates estimation and 
significance testing, although, since the equations of estimation are in 
general non-linear, iterative methods are required in the arithmetical 
work of calculating estimates. 

The need to fit frequencies in this way arose from a method, devised 
by Dr. R. D. Hughes, Division of Entomology, C.8.1.R.0., of using 
the age distribution in an aphid population to study its rate of growth 
in the field. The device of using the proportions of individuals in the 
immature instars in a field population of aphids to give, firstly, the age 
distribution, and secondly, the rate of increase of the population, is 
the subject of a forthcoming paper by Hughes. In the present paper, 
some preliminary observations of instar distribution under controlled 
experimental conditions are discussed. 

Chapman and Robson [1960] have considered the age distribution in 
a stationary population subject to constant mortality. As this formula- 
tion leads to a geometric progression of expected frequencies, many of 
their results anticipate results given in this paper. However, the 
general objects and scope of the two papers are different. 

Aphid populations under uniform and favourable conditions increase 
at a constant rate, so that their expected numbers at equal intervals 
of time form a geometric progression. However, the numbers even of 
an initially small population quickly become too large to be counted 
accurately. An estimate of the rate of growth of the population may 
then be made from the age distribution. The immature aphid passes 
through four instars before reaching maturity. In a stationary popula- 
tion, the number expected in each instar is proportional to its duration. 
For some species of aphids, for instance A. craccivora, the average 
durations of the first three instars are probably equal under constant 
conditions, so that equal numbers will be expected in each instar in a 
stationary population. When the population is increasing at a constant 
rate, the numbers in each instar will approximate to a geometric pro- 
gression. 

If the common duration of each of the first three instars is c, and 
the growth-rate of the population is p, then the ratio of the number in 
each instar to that in the preceding one is approximately 


—cp 


é 
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In this result, mortality has been neglected, since it has been shown 
that under favourable conditions mortality in the immature stages is 
negligible. 

By taking a random sample from the aphid population and de- 
termining the number in each instar we can first check the validity of 
the assumption of uniform growth by testing whether the numbers are 
consistent with a geometric progression. Having satisfied ourselves 
on this point, we can then estimate the common ratio of the numbers. 
If c, the duration of each instar, is known, the growth-rate can then 
be determined. 

If the durations of the different instars are different, the above 
method will be modified, and the estimation of the growth-rate is a 
little more complicated; in principle, however, such data will still 
provide information from which the erowth-rate can be determined. 


II. THE BASIC DISTRIBUTION 


We consider a sample of N individuals classified into k classes, the 
observed frequency in class 7 being n; , with expected value 
Ea) =e (OL ey eee 
If N is assumed to have a Poisson distribution, so has each of the n, ; 
the joint probability density is then 
gS Nei 


: Tec) 


ar = Auf) ne fmt fy 1) = 
P(r ,™m ; , M-1) Ile \ he /n; ) fie bitin SOO pry, i 


where 


fl 


Ny =F 2Ne 2: + (k — Wn ; 
and 
o(u) =ltet--- te. 


From the form of the density it is apparent that N and T are a pair 
of sufficient statistics for the parameters \ and wp. Thus, questions of 
estimation may be referred to the joint distribution of N and 7. 

By summing the probability (1) over values of the n, leading to the 
given values of N and 7’, we find the joint distribution of V and T as 


P(N, T) = oN 2 Ne (2) 


where C(N, 7) is a numerical coefficient. 
The number WN has a Poisson distribution with mean Ad¢o(u); 


P(N) = &* [Ago(u)]"/N!. 
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Hence for given N, the conditional density of 7’, which is independent 
of X, is 


PT | N) = CW, T)u"/Ieo(u)]”. 


This is a special case of the distribution discussed by Noack [1950] and 
described by him as a power-series distribution. Our [o(u)|" takes 
the place of his f(z). From this expression for the density we see that 
C(N, T) is the coefficient of 1” in g(u)”. This fact enables the numerical 
coefficient to be determined directly. 

This conditional density of T provides the basis for estimates and 
tests of significance about u. 


II. THE COMBINATORIAL FACTOR 


The coefficient C(N, T) is seen to be a generalization of the binomial 
coefficient (for the positive binomial, when k = 2; for the negative 
binomial, when k = ~). It is the number of ways of allocating T like 
objects among WN different cells, none of which may contain more than 
k — 1 objects (see Riordan [1958], page 104, where some recurrence 
relations and moment formulae are also given). 

The coefficients are generated by the function 


wior = (te) = [ECE LZ C-or] 


On equating coefficients of u” we find 


Wen Ngee lh eta, 
110) War re gee 0 


The series has 1 + [7'/k] terms. This expansion in terms of binomial 
coefficients is useful, especially if k is large, since the first few terms 
then give a close approximation. 

Because of symmetry, C(N, T) = CIN, (k — 1)N — T] so that 
results for T > 4(k — 1)N can be found from those for T < 3(k — 1)N. 

From the generating function or otherwise we may also deduce the 
recurrence relation 

CON a= CNiet — hy) CON er) — CON — Laer) 
which is useful for computation. 

Hitherto we have been considering relations among the C(N, T) 
for a fixed value of k&. We now consider relations for different k, and 
shall indicate by a suffix the number of classes. 
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We may express the coefficients for k classes in terms of the co- 
efficients corresponding to the factors of k, by means of the following 
reduction formula. 

Ii k = fg, 


iii) oe 
on | = u pee Ae of Og ° 


Hence 
CUNT) = CIN, T) CN = ee 
tr C(N,T oa 2f/C(N, 2) ae ) 


a series of 1 + [7'/f] terms. The terms of these series are all positive, 
and generally smaller than those of the series (3) of products of binomial 
coefficients. They may have some advantages for computation, pro- 
vided the coefficients for f and g classes are known. 

By differentiating the generating function with respect to 1ts param- 
eter we may prove recurrence formulae of the type 


ON a= a Sil (4) 
Ih = DON A151) +e = 2)CWN al) 
ee OGN TN ipetee hy, 
When k& = 3, many particularly simple recurrence formulae may be 
established. Recurrence relations when k = 3 are 
SOL Nee Aone T) 


[ON = DOW = 1-7 — Dose = Ceo re eo 


1 
= oF (ON —T HCN TS) oN CN = he 2 


eet ie ee OT, 


the last being a particular case of (4). As & increases the recurrence 
formulae become more complicated. 


IV. MOMENTS OF THE CONDITIONAL DISTRIBUTION OF T 


The moments and cumulants of the conditional distribution of T 
are of general interest, and will also be of use when the determination 
of probabilities in the tails of the distribution is being considered. 
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Since the cumulants for a sample of N are simply N times those for a 

sample of 1, we shall consider a sample of 1, for which 7’ equals X, the 

number of the class (0, 1, --- , k — 1) in which the observation falls. 
Clearly, if 


‘ a PNG k-1 
$,(u) = (. 2) do(u) = > CATES 


then the rth moment about zero is $,(u)/¢o(u). This result is equivalent 
to the results of Noack [1950], though expressed in a slightly different 
manner. 

However, unless k& is small, these expressions are not convenient 
for the computation of the central moments and cumulants, which are 
more simply found directly, 

The moment-generating function of X is 


Ee") = Do ure’*/ >) u® = go(ue’)/bo(u). 


Thus the cumulant-generating function is 
K(s) = log do(ue’) — log go(u) 
= —log (1 — ne’) + log 1 — u*e") + log (1 — ») — log (1 — 2’) 
Tr rk 
eae, 


From this expansion we derive the particular results 


ae wae ee kes rm 
pam (GE eee OM aril 
ee a Dye eet  Cie o ue) 

(1 ‘3 yu)” (1 = y')? ) 
ee Bh aw th Mut + dy + uM) 
; Coon Coy 


The form of the cumulant-generating function shows that, if u, is a 
geometric variable with parameter yu", then X + ku, = uw. It also 
follows that, if u,(N) is a negative binomial variable with parameter py“ 
and index N, then T + ku,(N) = w(N). 

When uw = 1, the distribution is the uniform discrete distribution, 
and the cumulants are expressible in terms of Bernoulli’s numbers. 
We then have 
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shee 
) + 18 (-) 


eae 2 B, 
= ayes + (k 1) 3 
where s/(e’ — 1) = 1+ >> B,s'/r! = e”* in symbolic form. Hence 
x, = (k” — 1)B,/r; in particular, mm = (k — 1)/2, ke = (k? — 1)/12, 
kz = 0, ke = —(k* — 1)/120. 


K(s) = —log (¢ = : 


2 4 
8 ae 4) ose 


Relation between Cumulants Corresponding to » and furs 


When the series of frequencies is reversed, the ratio is replaced 
by its reciprocal. It therefore follows that, when u is replaced by nee 
the odd cumulants other than x, are simply changed in sign, and the 


even cumulants are unaffected. We have 
KiGie) =k — lee (a), 
eu) = CLK) Perel 


These results may also be readily verified from the form of the cumulant- 
generating function. 

Because of these facts, we need not consider values of » outside the 
range (0, 1). 


V. ESTIMATION OF THE RATIO OF FREQUENCIES 


In the conditional distribution, 7 is a sufficient statistic for the 
parameter py, the common ratio of the expected ferquencies. Thus the 
estimation of pu is straightforward. However, since the equation of 
estimation by the method of maximum likelihood is non-linear, iterative 
methods will be required to solve it. 

The logarithm of the conditional probability of 7, apart from terms 
independent of the parameter, is L = T log » — N log ¢o(u), and its 
first derivative with respect to p is 


(7/1) oe (N¢1/udo) ‘ (5) 


Equating the derivative to zero gives the maximum likelihood estimator 
of wu. We indicate by an asterisk the maximum likelihood estimator and 
functions of it. Thus 


T — (No%i/¢%) = 0. (6) 


Since the equation is linear in 7’, it is clear that the estimator is to be 
found simply by equating 7 to its expectation: T = Nx* , as may be 
verified from the results of the previous section. 
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The variance of the derivative (5) gives the information J, about pu 
in the sample. Being linear in 7’, the derivative has variance 


7 iP = Nex)? =X iM = key" ) = 

ay be N 2/K : (1 ae a)? (1 a uy? =e [i : 
This result is a special case of the results of Patil [1961], who investi- 
gates the estimation of the parameter of a generalized power-series 
distribution. In large samples, the reciprocal of I , approximates the 
variance of the estimate u*; that is, V(u*) = p?/Nxo . 

For purposes of calculating and tabulating the solutions of the 

maximum likelihood equation, we shall put 


Lk aN 19, 
so that, for all k,0 <v < 1. 


Then the estimating equation may be written 


Sa me et eae aa 
ae ts) Poa” 1)v. (7) 


An alternative form is 


il k 
lL —p l—yp 


ie —(k a pre! = v), 


or 


—k 


L eae ky = Fa Ae 
eae: ete (k (1 v). 


* ig the root 


Thus, if u* is the root corresponding to v, then u* 
corresponding to 1 — v. In particular, if v = 4, u* = 1. 

Since the roots corresponding to values of v exceeding 4 are the 
reciprocals of roots corresponding to values of v less than 3, we may 
henceforth confine attention tov < 4,y <1. 

Equation (7) can be solved iteratively, once an approximate value 
of u has been chosen. If v is not too near 4, and k is not small, a first 
approximation is 

(k — l1)v T 


Eee erat Gaeety = eee > 


and a second approximation is 
k 2 
Be = wy + kw — wy)’. 


The difference between the two sides of (7) when an approximate 
value of u is substituted represents —p/N times (5), the first derivative 
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of the likelihood. The adjustment to pu is given by the ratio of this 
difference to —yJ,/N. Then, approximately, 


ut = ne = tie Te) 


Kg 


The substitution and adjustment may be repeated as often as required 
to give the desired accuracy. 

If v is close to 4, an approximation alternative to (8) is to be pre- 
ferred. We put 


-86 
wee, v= z—w. 


Then equation (7) becomes 


—@ deen 
ae ae f __ = (k — 1)(4 — wv) 
coca 


or, symbolically in terms of Bernoulli’s numbers, 


5 (eM — ef) = (k — 18 — w). 
We then find 
io 
Ba a 1) oo Ge (eee 
2! 4! 
whence 
_  12w 144 (k? + Dw® : 
and 
os ee 72w 144 (kh? + 11)u* 4 
ae eases mire ee 


Solutions of the maximum likelihood equation for various values 
of k and v are given in Table 1. Once values of u are given, it is easy 
to compute 7, . As we shall see in Section X, the solution not only 
gives a point estimate of the ratio u, and an approximate standard 
error based on the information function J, , but also gives a means of 
determining tail probabilities, needed in making significance tests and 
setting confidence limits. 


VI. ESTIMATION OF i 


In general, for the problems considered in this paper, the actual 
value of \, representing the size of the population (or rather, of the 
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TABLE 1 

Maximum LikeLimoop Estimate oF pu [with ARGUMENT v = 7'/(k — 1)N] 

k 2 3 4 5 
0.00 0 0 0 0 
0.05 052632 092894 131334 167119 
0.10 111111 178395 238358 291010 
0.15 176471 261940 333333 392939 
0.20 250000 346500 422530 483323 
DS 333333 434259 509668 567737 
0.30 428571 527202 597388 649654 
0.35 538462 627431 687922 731617 
0.40 666667 737405 783468 815815 
0.45 818182 860221 886484 904443 
0.50 1.000000 1.000000 1.000000 1.000000 


zero-class), is not of interest. However, for testing the significance of 
departure of the frequencies from a geometric progression, estimates of 
the expected frequencies in each class will sometimes be required. 
We shall then be interested in the simultaneous estimation of \ and H, 
based on N and T. We therefore consider this question, rather than 
the estimation of ) alone. 

The joint probability of N and 7, as given in (2), is 


CN, i) —dAdo(w)\N T 
nth a eae 
so that the logarithm of the likelihood, considered as a function of the 
two parameters, and with constant factors ignored, is L = —)d¢o(u) + 
N log \ + T log uw. The derivatives are 
Lh = —do(u) = (N/d), (10) 


L, = [—Adi(u) + T]/u. 


The estimates are given when these derivatives are equated to zero, 
or when WN and 7 are equated to their expected values. For direct 
solution, \ can be eliminated between the two equations, giving an 
equation for uw identical with (6). jw having been found, \ may then be 
found by substitution in (10). 

For simultaneous solution, and also to give a measure of the infor- 
mation about A and yu, we require the covariance matrix of the deriva- 
tives. Noting that N is a Poisson variate, with variance equal to its 
mean Ad¢o(u), that the covariance of N and T is A¢,(u) and that the 


l| 
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variance of 7’ is \¢2(u), we find for the covariance matrix of the deriva- 
tives 


7 ee di/Nu | 
bi/ Ma bo/H 


The inverse of I, giving approximate variances and covariance of 
the estimates in large samples, is 


au 1 | Ne Ans | 
Nob. — $1) een 


From trial values , » of the parameters we find improved estimates 
by means of the equations 


pa i + ri? 
| ig mM IG, 


Mm 


which give explicitly 


(11) 
pt = afi eae hs fel) 
Nob: — $1) 

Note that the trial value of \ does not appear in the equation for 
\*, since the estimating equations are linear in \*. The adjustments 
given by (11) may be repeated to give the accuracy required. 

The expected frequency in class 7 is \u'; the variance of the esti- 
mated frequency may be determined, using I’, as 

gobs — $i 
The relative variance of the estimated frequency is therefore 


bo — 2dr + tbo. 
(dob. — $1) 


VII. TESTS OF DEPARTURE FROM PROPORTIONALITY 


($2 — 2id, + 70). ? 


Before any use is made of data to which proportional frequencies 
have been fitted, it is necessary to test whether the observations depart 
significantly from the estimated frequencies. 

Following Cochran [1954], we shall denote by X” the statistic used 
for testing the discrepancy between observed and expected frequencies, 
and by x’ the random variable with the familiar distribution, to which 
the distribution of X* approximates. We consider here the adequacy 
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of the x°-approximation in testing X? for departure from expectation 
in the present problem. 

Alternatively, we may use the fact that, because N and T are 
sufficient for \ and u, the probability of a sample, conditional on N and 
T, is independent of the parameter values. An exact conditional test 
can thus be made, provided there are several possible samples corre- 
sponding to the given values of N and 7. The probability of a sample 
of values % , 7”; , --* , M-1 With SS mn, = Nand Dy wn, = <T is 


1 
el (n; !) 
Likewise the joint probability of N and T is 
CW, T) 
N! 


where C(N, T) is as defined in (2). Hence, the probability of the sample 
conditional on N and T is 


—Ado(H) NT 
Tare 


—ddo(u)\N 7 
e€ PONS has 


N! 
C(N, T) [I @: 


By enumerating all possible samples and their probabilities we can 
decide the significance of an observed sample. Often the set of samples 
deemed to show significant departure from proportionality will be taken 
as a set of the samples with the smallest probabilities; however, in 
many problems, other criteria may be considered more appropriate. 
The use of X° has the advantage that it orders the samples according to 
a quantitative measure of departure from proportionality, whereas the 
probability attaching to a particular sample may be small merely be- 
cause of the discontinuity of the distribution. A satisfactory arrange- 
ment would therefore be to order the samples by means of X’, but 
to use the exact probabilities. In many cases the exact cumulated 
probabilities will not differ much from those given by the x’-distribution. 
The questions of calculation of probabilities and of choice of the signifi- 
cant set have been thoroughly discussed by Fisher [1950]. 

As an example of the exact calculation, we consider the following 
data in four classes: 


(12) 


Observed Expected 
no 30 27 
nM 5 9 
N2 2 3 
N3 3 1 


596 BIOMETRICS, DECEMBER 1961 


It will be seen that, because the numbers are restricted to four classes, 
the possibilities are much fewer in number than in other examples, 
such as that of the testing of deviations from a Poisson distribution, 
as discussed by Fisher. For this set, N = 40, 7 = 18, and it is readily 
verified that n* = 3, \* = 27. X” is easily calculated as 

2 2 
es nee 
however, since the expected values in two of the classes are small, 
the tabulated distribution of x’ is not likely to be a good approximation 
to the distribution of X’ for this sample. For this reason, as an example 
of method, the exact probabilities have been determined for the 37 
possible samples having values N = 40, T = 18. The probabilities 
and the values of X” are given in Table 2. 

We have C(40, 18) = 220,495,831 X 10°. 

From the Table it is seen that the sample in question just attains 
significance at the 5 percent level; this is so whether the samples are 
ordered according to their probability or by X°. The tabulated distri- 
bution would give a probability, for x° with two degrees of freedom, 


é.* "tee? ax 01039866. 
This shows that, for this example, the tabulated distribution under- 


estimates the significance probability only slightly. On the other hand, 
for the sample 


BL 334.3 
for which the probability accumulated according to X” is 0.008190, 
X” = 8.593, 
the probability of exceeding which is 
0.013619. 


The probability here is overestimated somewhat. 

As a matter of interest, the mean and variance of for the distribution 
generated by this set are H(X”) = 1.9956, V(X”) = 3.4223, compared 
with the values of 2 and 4 respectively for the x’-distribution. The 
low variance will usually lead to overestimation of probabilities at the 
tails of the distribution and underestimation of significance, when 
referred to x’, though this effect will sometimes be masked by local 
irregularities of the distribution. 


VIII. MOMENTS OF THE NON-PARAMETRIC DISTRIBUTION 


We now show how the moments of the exact distribution of X’ 
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TABLE 2. 
THE FREQUENCIES IN Four CLassEs FoR wuicn N = 40, T = 18 


Cumulative 
no ny No n Probability probability PHC ye 
34 ) 0 6 .000000 .000000 1048 
31 0 9 0 .000001 .000001 610 
33 0 3 4 .000003 .000004 522 
33 if 1 5 .000004 .000608 696 
32 0 6 2 .000010 .000018 376 
32 3 0 5 .000019 .000037 646 
31 1 7 i .000089 .000126 352 
32 1 4 3 .000098 000224 334 
32 2 2 4 .000146 .000370 424 
30 2 8 0 .000173 .000543 408 
22 18 0 0 .000514 .001058 376 
31 = 1 4 .000781 .001839 370 
30 6 0 4 .000807 .002646 360 
31 2 5 2 .000938 003584 226 
31 3 3 3 .002083 .005667 232 
30 3 6 1 .003229 .008896 198 
29 4 7 0 .003460 .012356 250 
24 15 0 1 .004561 .016917 198 
28 9 0 3 .005574 .022491 190 
23 16 1 0 006841 029332 226 
26 12 0 2 .009578 .038910 136 
30 5 2 3 .009688 .048598 174 
30 4 4 2 .012110 .060708 120 
29 ff 1 3 .013840 .074547 160 
28 6 6 0) .023412 .097959 136 
29 5 5 1 .029063 . 127022 88 
24 14 2 0 .034206 . 161228 120 
25 13 1 1 .038311 . 199539 88 
27 10 1 2 046824 . 246363 66 
29 6 3 2 048439 . 294802 58 
27 8 5 0 .070236 .865038 66 
28 8 2 2 .075253 .440291 40 
25 12 3 0 .083006 .523297 58 
28 7 4 1 . 100337 .623634 22 
26 10 4 0 . 105354 . 728988 40 
26 11 2 1 . 114932 .843920 22 
27 9 3 1 . 156080 1.000000 0 


may be determined. This distribution is independent of the population 
parameters, and depends only on the values of N and 7. 
We consider first the conditional moments of the n; . From ex- 


pression (12) we see that 
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ae 


where the sum is taken over all sets of the n; such that >> n; = N and 
> in; = T. Now the expected value of n; 1s 


N! 


CW, aw. = To ) 


In the sum, the total of the arguments, with n; replaced by n; — lI, 
is N — 1, and the weighted sum of the arguments is T — 7. It follows 
that Hore NC(N — 1, T — p/CW, T). By a similar method it 
follows that, if >> a; = = and >. ia; = B, then the general factorial 
moment is 


5] (Le |- NY (CW = 4, By 
ThiGe saat ey =a) CW, T) 


Thus in principle the joint factorial moments of the n; can be de- 
termined provided we can regard the C(N, T) as known. Some dis- 
cussion of the asymptotic representation of C(N, 7) is given in Section 
X, but more needs to be known about these coefficients in general. 

For the test of departure from expected frequencies we have 


2 
ate as 


Now d* and p* are functions of N and T only, so are fixed for the con- 
ditional distribution. Hence X” may be simply regarded as a weighted 
sum of squares of the n,. In particular, since 


N(N —DCN=— 2.7 — 204 NCR) 


Wed CW, 7) 
we have 
de N 
OO) SGU B) 
N — DCW — 2,7 — 21 ines 
SS ( \C(N — 2,7 2 iC Neale oem N: 


higher moments may be found similarly. 


IX. TEST FOR TREND IN FREQUENCIES 


Of particular importance is the test for the reality of any apparent 
trend in the frequencies—that is, whether the estimated value of u 
differs significantly from unity. When WN is large, the test may be 
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made by comparing the difference 1 — u* with its standard error, or, 
what is equivalent, testing }(k — 1)N — T against its standard error. 
When u» = 1, the variance of u* reduces to 12/(k? — 1)N, so that 
the test statistic, distributed approximately as x” with 1 degree of 
freedom, is 
(hk? — 1)N(1 — u*)?/12. (13) 
More directly, the variance of T is 
[((k? — 1)N]/12, 
so that an alternative test statistic is 
12(3(k -1)N -—T)/(k — DN. (14) 


The statistics (13) and (14) are equivalent in large samples since, 
when yp* is near to unity, we have 


_ 26@ = DN — 7) 
Coy 


u* = 1 


+ OW™), 


as can be seen from (9). 

The exact significance probability is given by the probability of a 
value of T less than or equal to that observed, which is easily computed 
directly when N is not large. This probability should be doubled, 
since both tails of the distribution are relevant to this test. With 
uw = 1, this probability is simply 


e 
WCW, A/F = SW) T)/k*. 
Now the generating function of S(N, 7) is 
[Sow / — ») = (1 — )"/0 — »)”™. 


From this representation we readily deduce that 


ann 57) -(NET 
Bee ta. DE ee Lis 


which provides the most convenient means of computation of isolated 
values of the sum. On putting 7 = (k — 1)N in (15) we have the 
interesting corollary 


i) Ne +E 9) = 2 
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Values of T less than 3(k — 1)N required for significance at the 5 
and 1 percent levels of probability have been determined for various 
values of k and N, and are presented in Table 3. For values of NV beyond 
the range of the Table, the large-sample x’-test may be used. 


TABLE 3 


Exact Trst ror EXISTENCE or TREND 
(1.8. or Nui Hyporuesis » = 1) 


5 Percent Point or T [or (k — 1)N — T] 


k 
N 3 4 5 
5 1 2 3 
6 1 3 4 
a 2 4 6 
8 3 5 7 
9 3 6 o, 
10 4 W 10 
11 5 8 12 
12 6 9 13 
13 6 11 15 
14 7 12 17 
15 8 13 18 


1 Percent Point or T [or (k — 1)N — T] 


k 
N 3 4 5) 
5 0 0 1 
6 0 1 2 
a 1 2 4 
8 1 5) 5 
9 2 4 6 
10 3 5 8 
itil 3 6 9 
12 4 a 11 
13 5 8 12 
14 5 9 14 
15 6 11 15 
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X. EVALUATION OF TAIL PROBABILITIES 


When N is large, the calculation of the probabilities of the tails 
of the distribution is troublesome. Blackwell and Hodges [1959] have 
given a method for expeditiously finding approximate tail probabilities. 
The method depends on a transformation of the distribution to a 
new distribution for which probabilities in the neighbourhood of the 
mean are to be determined. Daniels [1954] has given equivalent re- 
sults, expressed explicitly for continuous probability densities, and 
Good [1957] has given similar results to the term of order N~? of the 
leading term. 

We briefly outline Blackwell and Hodges’ results applied to the 
present distribution. We make use of the fact that the distribution 
of T is the N-fold convolution of the distribution of X as defined in 
Section IV. 

The moment-generating function of X — a, where a is any constant, 
is M(s) = Efe’*~”}. 

Let s* be the unique value of s that minimizes M(s), and denote 
the minimum of M(s) by m/(a); note that s* is a function of a. Then 
we transform to a new variable Y, for which 


PUK = ae 9 


P(Y =2) = WG) 


(16) 
It is readily seen that Y has in fact a probability density with the same 
range as X. Also, by differentiating the numerator of (16) with respect 
to s* and summing, we find that H(Y) = a. 

Before continuing with Blackwell and Hodges’ method, we establish 
the interesting result that, for any distribution for which the sum of 
the values of X is a sufficient statistic for the parameter yw, the new 
variate Y has a distribution of the same form, but with parameter u*, 
the maximum likelihood estimate corresponding to the value X = a. 
This result is probably well known, though we have never seen it 
discussed. Thus we see that, for distributions of this commonly occurr- 
ing class, the solution of the maximum likelihood equation is important 
not only for providing a point estimate, but also for evaluating the tail 
probabilities corresponding to the particular observed value. 

If the sum of the values of X is sufficient for u, the density of X 
may be written as 


h(xje" fu), (17) 
where 


flu) = Do hayes. 
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Then 
M(s) oe S, Aye as 


so that, for any value of s whatever, a transformed density is given by 
Wl ke ee He MNS) 


This density is clearly of the same form as that of X. 
Now the maximum likelihood equation, corresponding to an observed 
value a, turns out to be 


a— > shave" / > A@e*” =0, 


where we have differentiated (17) with respect to g(u) rather than u 
itself. This is in fact H(X | » = u*) = a. Now from (16) we derived 
that E(Y) = a. It follows therefore that the parameter of the distribu- 
tion of Y is y*. 

We can also express m(a) in terms of the maximum likeli- 
hood estimator. On putting x = a in (16), we see that m(a) = 
P(X = a)/P(Y = a), or Pa | »)/P(a| w*). 

For the particular distribution we have been considering, therefore 


wo = (28) 


where we indicate by an asterisk a function of y*. 


Blackwell and Hodges, by means of the transformation (16), and 
taking Na = T,, show that 


P(X, + Xp b+ + Xy = T) = [m@) PO Yabo ae ed 


and thence, that each side is equal to 


pier | ace 5 at) -)) 
A/ig We BN Na? ae) eae 


or 


He *\N : 
led 0 1 il i. — Do Ke ) 25 | : 
(Ss) va Von x E + an let > Bet) > OCD | eds) 


where the «* are the cumulants of the distribution of Y; that is, func- 
tions of u*. The explicit terms in (18) give a satisfactory approximation 
to P(T | N), provided N is large. 

We note that the series in (18) depends only on y* (i.e., on 7’), and 


not on w. Thus what we have obtained is an asymptotic expansion of 
C(N, T). In fact, 
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C(N, T) = eee E 1 i. D i) - 
ak / 2a NE ee | 


Thus, C(N, T) may be estimated by substituting the actual values of 
the cumulants, from the formulae given in Section IV. 

As a check on the asymptotic formula, we estimate C(N, T) for 
the values k = 4,.N = 15,7 = 11. We deduce p* = 3, do) = 15/8, 


Ke = _ 194/15° = 0.862222, 
k¥ = 2842/15° = 0.842074, 
kit = 1434/15° = 0.028326. 
Hence 
. 15 ae Z 1 805604) ¥ 
Glib. 1) an-194 @)" 1 0 = 2,784,963. 


The true value is 
25). (2 (2) aes) ae 
fe) ’) ie) orb) a ie Ate 
from which the estimate by the asymptotic formula is in error by 180 


per million. 

For significance testing, the cumulative tail probabilities are more 
important than the individual terms. These cumulative probabilities 
are also given by Blackwell and Hodges. They give 


[m(a)]* | Fie = ns Be) 
Ee Wa ey BN Ne? Bg 


Z es —z) + «40 + 2) n ow" | 


~ ON KF°(1 — 2)” 
Cire) let eed = 
re) E ON «FP — 2 TON!) 


where, for the distribution considered here, z = u/u*. We note that, 
as is to be expected, the bracketed series in the cumulative probability 
depends on ». The approximation is valid only for u* > yu, and is 
most effective when z is small—that is, when u* is large and the extreme 
tails of the distribution are being considered. 

When we are determining the probability in the lower tail of the 
distribution, that is P(7’ < #), so that T < 1(k — 1)N, and p* < gp, 
we simply replace z by z' throughout the formula, and change the 


sign of «%. 
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As a test of the adequacy of this approximation, we apply it to the 
example for which the individual term was calculated above, namely 
k= 4,N = 15, T = 11. 

Inserting the values found above, we have z = 2y, 


P(T = 11) 
1 
(: i 2) 
—2842f, 1 aa 2) 
Jeger fo: ( x) + ihe gage 
60. 194? (1 - 2) 
15° Qu. 


We consider the adequacy of the approximation when pu = 1, for which 
we can readily determine the exact probability. We have 


P(T-= 11) = 2;784600/4" 


P(T < 11) = 


a aa 


so the leading term in the approximate probability is 
2P(T = 11) = 5,569,200/4"" = 0.005187. 


For the first adjustment we find the factor 1 — 0.078223 giving 
5,133,560/4"" = 0.004781. The exact value of the denominator is 


sas, = (2) ~ (12) (298) sas, 


giving the exact probability 0.004893. 

We see that the first adjustment gives an approximation adequate 
for the purpose of assessing the significance probability. Since both 
tails of the distribution are relevant to any test of significance, the 
significance probability is 0.009786; 7 = 11 is thus the 1 percent point 
of the distribution. 

By determining the tail probabilities corresponding to any given 
value of » we can in principle set a confidence range for » — the set of 
all » which are not discordant with the observed values N and YT. 


XI. APPLICATION TO EXPERIMENTAL DATA 


The data in Table 4 show numbers in each of the first three instars 
of immature cowpea aphids (A. craccivora), from samples drawn on six 
different occasions from the same population. Since the experimental 
conditions were uniform, and the average duration of each instar is the 
same (about 42 hours at 20°C), the expected numbers are in geometric 
progression. The Table also gives an overall estimate of the common 
ratio, 0.529820. In order to determine the growth-rate of the population, 
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TABLE 4 
Acs-DISTRIBUTION OF IMMATURE CowPra APHIDS (A. craccivora) 


Instar 


Sample I II Ill N fi * je Xe 


181 92 51 324 194 179.730 0.526015 0.11 
148 78 42 268 162 147.737 0.531518 0.01 
130 54 43 227 140 123.456 0.543412 4.07 
37 21 128 79 69.580 0.543848 0.03 
88 41 20 149 81 87.710 0.474049 0.13 
85 42 28 155 98 82.857 0.558629 0.63 


Ooo» r Whe 
J 
i=) 


Total 702 | 344 | 205 1251 754 690.958 0.529820 2.14 


ce 


we equate this ratio to e °’, c being the average duration of instar and p 
the growth-rate. 
We then find 
cp = —log (0.529820) = 0.6352 
so that 
p = 0.6352/42 per hour = 0.01512 per hour = 0.363 per day. 


0.363 


The population grows at the rate of 100(e — 1) = 44 percent per day. 
Table 4 also gives estimates of \ and uw for each sample, and X” 
with 1 degree of freedom measuring departure from the common ratio. 
Only the largest X°-value, for sample 3, attains the 5 percent level of 
significance, so that there is no evidence for departure from hypothesis. 
The variance of the estimate of » from a sample of N is p’/V(T) 
which reduces when k = 3 to 
(GS ye Te 
N( + 4u + 2’) 
Using the overall estimate u* = 0.529820 we find the variance of esti- 
mate to be 
Vig) = 0:510813/N- 
The consistency of the estimates of » from the different samples 
may be tested by X’. 
For consistency, 


X? = (0.526015? X 324 + 0.531518” X 268 
+ --- — 0.529820° & 1251)/0.510813 
= 1.84, with 5 degrees of freedom. 
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The individual ratios differ no more than would be expected by chance. 

After the manner of Section IX, formulae (13) and (14), an alter- 
native statistic for testing consistency is based on comparison of the 
ratios T'/N. 


V(T/N) = wl + 4u + w)/NG + w+ 2)? = 0.549534/N 
Hence 
X? = (1947/324 + 162°/268 + --- — 754°/1251)/0.549534 
= 1.33. 
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COMPUTING PROCEDURES FOR ESTIMATING 
COMPONENTS OF VARIANCE IN THE 
TWO-WAY CLASSIFICATION, MIXED MODEL? 


S. R. SEARLE, 
New Zealand Dairy Board, Wellington, New Zealand, 
AND 
C. R. HenpERson 
Cornell University, Ithaca, N. Y., U. 8. A. 


INTRODUCTION 


Methods for estimating variance components from unbalanced data 
are given in Henderson [1953] for both the random model and the 
mixed model. The calculations involved in the latter case are some- 
what tedious and usually not computationally feasible for data having 
many classes. This paper outlines the analysis for unbalanced data 
from a two-way classification with one classification considered fixed. 
Simplified computing procedures are presented suitable for a large 
number of levels in the random classification and a reasonable number 
of levels of the fixed classification. 


MODELS AND ESTIMATION 


The model for the two-way classification can be taken as 
Die Pb Ady Bp eB yeh Cin 5 (1) 


where 2;;;, is the observation and yp is the general mean. a; is the effect 
due to the z’th level of the a-class, 6; the effect due to the j’th level of 
the 6-class, a8;; the interaction and e,;, a random error term. We will 
suppose that the number of a-classes in the data is a, the number of 
B-classes is b, that there are n;; observations in the 2j’th subclass, and 
that s sub-classes have observations in them. All terms except u are 
taken as independent random variables in the random model with zero 
means and variances o7 , 73 , 743 and a; respectively. The #-classifica- 
tion will be considered fixed in the mixed model, the variances involved 
being o2 , oz, , and o; , the interaction terms being random for the 
particular set of fixed effects occurring in the data. In this case we are 
assuming an underlying multivariate distribution with v;;, having mean 


1Parts of this paper are based on the thesis submitted by the first author in partial fulfillment of 
the requirements of the Ph. D. degree. 
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value » + 6, and variance oj + oa + o : the covariance between 
v3, and 2;;,- fork # k’ is o2 + o%, and that between 7; ;: and 2i3"%: 
forj #jandk #k’, iso. 

Variance components can be estimated in both models by equating 
linear functions of sums of squares to their expected values, the sums of 
squares used being reductions in the total sum of squares due to fitting 
various elements of the model as if it were a fixed model. For example, 
fitting (u + «;) results in matrix equations of the form 


Pa = yj, 
where y is the vector of a-class totals z;.. , and @ is the vector of esti- 


mates of the a’s. P is non-singular, a diagonal matrix of order a, of 
terms n;.. Thus 


a= Py, 
and the reduction in the total sum of squares due to fitting the a’s is 
R(u,a) = ey = y'/P'y. 


This is the usual uncorrected sum of squares, namely 


Ry, a) = 2 ce a (2) 
Similarly 
Riu, 8) = X Naas (3) 
RG a, 8368) — yy pe tii. /Nis 5 (4) 
Ry) 2, (5) 
and 


R(O) = dX > es Ui in : (6) 


The variance components of the random model are estimated by 
equating the expected values to the observed values of differences among 
the above uncorrected sums of squares, namely (2)—(5), (8)-(5), (4)- 
(2)-(3) + (5) and (6)-(4). These cannot be used in the mixed model 
because, apart from (6)—(4), their expectations then contain functions 
of the fixed effects. The within-subclasses sum of squares, (6)—(4), 
can still be used to estimate the error variance as in the random model, 
and «2 and o%,, can be estimated from the differences: 


Ru, Qa, B) —- R(u, 6), 


and 


R(u, as 85 a3) a Ru, Qa, 8), 
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whose expectations are free of terms in the fixed effects. (wu, a, 8) is the 
reduction in the total sum of squares due to fitting a and 6 alone without 
the interaction terms. This reduction in sum of squares does not occur 
in the random model analysis and, when required for the mixed model 
analysis it usually involves a considerable amount of computing. Its 
relationship to the random model analysis will now be shown and a 
simplified computing procedure derived. 


SIMPLIFIED EXPRESSION FOR R (u, a, 6) 


Henderson et al. [1959] have shown that fixed effects in a mixed 
model can be estimated by treating the random effects as fixed and 
maximizing the joint distribution function of the observations and the 
random effects. The resulting equations for the two-way classification 


A 


where P is a diagonal matrix, of order a, with terms n,. , R is a diagonal 
matrix of order b — 1 with terms 7.; ,j ¥ b, Q is a matrix of order a 
by b — 1 with terms n,; , 7 ¥ b, y is a vector of the z,.. totals and z 
is a vector of the z.;, totals. R, Q and z’ have b — 1 columns because of 
omitting the equation for the last 6, appropriate to imposing the con- 
straint 8, = 0 in order to have a unique solution. R(u, a, 8) from these 
equations is 


R(u, a, B) = wo”) 
Z 


-yxf? 9 
Oa Zz 


This expression for obtaining R(u, a, 8) requires inverting a matrix of 
order (a + b — 1), which is not feasible in many situations because a, 
the number of random classes in the data, is large. For example, the 
analysis of dairy production records in Searle and Henderson [1960] 
involved 688 herds (random) and 4 age groupings (fixed) so that a 
matrix of order 691 would need to be inverted for equation (8). How- 
ever, because of its special form, it can be reduced to inverting a 3 X 3 
matrix. Thus in the general case for the two-way classification the 
‘inversion of an (a + b — 1) matrix can be reduced to inverting one of 
order (b — 1) and this is a considerable reduction in the computing 
required especially when a, the number of random effects, is large, 


as is frequently the case. 
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The simplification of (8) proceeds as follows. Let 


fe os * *| (9) 
Q’ B’ D 
Then 

A= PP ODO Pa. 

B = —P''QD, (10) 


D=(R— QP'Q)", 


R(u, a, B) = a]? aration) pe pala 
—DQ’P™ pd Iz 


= yP’y + (z’ —y'P Q)D@ — PO). (11) 

The matrix P is easily inverted, being diagonal, and the only non- 

diagonal matrix to invert is D = (R — Q’P’Q)™ of order b — 1. 

The first term in the above expression is R(u, a), and the second 
can be derived from equations (7). Eliminating 


a= Py — Q8), 
gives 

6 = D(z — Q’P’’y). (12) 
This is easily computed and can then be used to obtain Rg, , the re- 
duction in the sum of squares due to fitting the 6’s in this no-interaction + 
model, as 


Rp = 6'(z — Q’P''y). (13) 
Thus from (11) R(u, a, 8) can be expressed as 
Ru, Qa, B) im Ru, a) sts Re ) (14) 


the first term of which is part of the random model analysis and the 
second term comes from (12) and (18). 


EXPECTED VALUES 
The expectation of R(u, a, 8) can be found by using (8) and (9) 


and writing 
Ru, «, 8) = (y’2")| 4 au 
DILz 


= (y’Ay + z’By + y’Bz + z’Dz), 
= tr (Ayy’ + 2Bzy’ + Dzz’), 
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where tr(A) is the trace of the matrix A, the sum of its diagonal terms. 
After substituting from (1) for the vectors of totals y and z and also 
using (1) in (8), (4) and (6) it can be shown, with a certain amount of 
algebraic manipulation, that the differences used for estimating the 
variance components have expected values 


E[R(u, a, 8) — R(u, 8)] 

= (n., — ha)oz + (hg — fa)oes- + (a — Io. , 
E[R(u, a, B, a8) — R(u, a, 6)] 

= (n.. — kpows + (s —-a—b+ 1a, 
E{RO) — Rtu, a, B, o8)] 

= (n.. — s)o. . 


The variances are estimated by equating the right-hand sides of 
these equations to the calculated values of the differences in the square 
brackets of the left-hand side. The term k; is one of the coefficients used 
in the random model analysis, namely kz = >.>, [>.°-1 2,]/n.;.. The 
coefficient kz comes from the expected value of R(u, a, 8) and can be 
expressed in the form 


k= 2 tr (AU => 2BV -- DW), (15) 
where U is a diagonal matrix of order a with terms )J;-; 77; for 
4 = 1--- a, V is a matrix of order a by b — 1 with terms ni, 
fori = 1---a,andj =1--- 6 — 1, and W isa diagonal matrix of 


order b — 1 with terms >.7_, n?; forj = 1---b—1. 
COMPUTING Rg AND kg 


R; is obtained from computing the following terms: 
(i) A square matrix C, of order (b — 1) with terms 


tly (ee De nis /Ni. : j = Ky Bien be — i 
and 
Cii, = — o ninige/M. JA = lige ee eo — 1s 


Computing c,; provides the check that De aie Cpe het Oy Ornelas ae. 
row and column totals of the augmented C are zero. 
~ (ii) A column vector r, of order b — 1, whose terms are 


r= 2.5. — Dy Maki.. ; 4 eo logeLD ole 


Computing r, provides the check that the sum of all the 7;’s is zero. 
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(iii) A column vector, of order 6 — 1, of the estimates of the §’s, 
6 = Cur 


(iv) The “inner product” of the terms in the preceding two vectors is Rg: 
b-1 
R; = 64 = >. G1; = 1'er. 
j=1 


This is equation (13), the C~’ used here being identical to D. 

Expression (15) for kg is obtained using the elements of C™*, which 
we shall call d,;, , obtained in the calculating of Rg above. The special 
forms of the matrices involved in (15) enables kg to be written as 


a b-1 a b-1 b-1 a 
kp =e ys Neer pe AOD ia 2.2 PE Oe a(S po 


where 
b-1 
Xe Os ) 
jaa 
fiat = (nz;/n:.) Oita ORR 2nii); 
and 


fisige = (Nii; /Ni.) Ai — Nig — Nai’)- 
The f’s, of which 3b(6 — 1) in number are required, can be calculated 
for each 7 and summed, and the expression is well-suited to evaluation 
on a computer. Desk calculation can be arranged from the same 
formula when b is small, and the number of a-classes (random) is not 
too large. Calculating all of the 4b(b + 1) f’s enables the following 
check to be placed on them: 


See ien a= = QL nisnss/n., « (16) 


i’=1 a 


The f-values and LS. d, can be obtained simultaneously with the terms 
of C andr; Rg is calculated as r’C~’r and kg is obtained using the elements 
of C™* as above. 


EXAMPLE 
The calculation of Rs and kg is demonstrated for the small hypotheti- 
cal example shown in Table 1. 


The steps for obtaining R, are as follows: 
(i) The c-values are 


53 — 17/10 — 37/20 — 127/40 — 127/50 — 25°/50 
45 — 2°/10 — 6°/20 — 12°/50 — 257/50 


C14 


a3 47 
27.42, 


I 
I 


Co2 
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TABLE 1 
HYPOTHETICAL EXAMPLE 
Number of observations n;; Mean 
observed 
ql Totals values, 
1 1 2; 3 4 Nie Zi, 
1 1 2 3) 4 10 200 
2 3 6 a 20 300 
3 1 — 12 16 40 400 
4 2 12 13 13 50 500 
5 2 25 — — 50 600 
Totals 
N.5 53 45 32 40 n.. = 170 
Totals of observed 
values, x ;. 23000 23000 14000 19000 
C33 = 32 — 3°/10 — 4°/20 — 127/40 — 137/50 == 23.324 
C4z = 40 — 4°/10 — 7’/20 — 167/40 — 137/50 = 26178 
C2 = —1(2)/10 — 3(6)/20 — 12(12)/50 — 25(25)/50 = —16.48, 
G3 = —1(3)/10 — 3(4)/20 — 12(12)/40 — 12(13)/50 => 7-02, 
and similarly 
Ca —=9.37, (Gy = 6.02, 
C23 = —4.92, Coa —10.78. 
The check on these values, namely >>”,_, c;;, = 0, can be seen to hold 


true; for example 
Ci 4. Ci2 a C3 - Ca DOLL = 16.48 => HOW ae 9.37 —— O. 
(ii) The r-values are obtained using the totals of the observations 


for the levels of the fixed effects and the means for the levels of the 
random effects: 


r, = 23000 — 1(200) — 3(300) — 12(400) — 12(500) — 25(600) = —3900, 


7, = 23000 — 2(200) — 6(300) — 12(500) — 25(600) = —200, 
rs= 14000 — 3(200) — 4(300) — 12(400) — 13(500) = 900, 
r,=19000 — 4(200) — 7(300) — 16(400) — 13(500) = 3200. 


A check on these is that their sum is zero. 
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(iii) 
33947 —1648 —7.62)_ 
C= D =| =1648. 27.42) —2.92 
27.6 Store 22 22 
(iv) 
Rs 


I 


r’Dr 
053824 .036902 025373 || —3900 | 
(—3900, — 200, 900)! .036902 .063205 .025393 | —200 
025373 .025893 .056530/| 9001 


= 736703. 


Calculating kg involves the elements of D and the f-values. The 
latter, together with the \,-terms are shown in Table 2. The calcula- 
tions for 7 = 1 are as follows. 


h, = (2 + 2? + 3”)/10 = 1.40, 
fia = 1770.4 + 10 — 2)/10 = .94, 
fogs = 2-4 4-10 — 4/10 = 22-96; 
fos = 3°(1.4 + 10 — 6)/10 = 4.86, 
fia = 1(22)(.4 — 1 — 2)/10 = —.32, 
fias = 13)(1.4 — 1 — 3)/10 = —.78. 


Similarly f,,14 = —1.44, f1.22 = —2.16, fi.2s = —3.68 and jf, 44 = 0.12: 
The sums of these terms, over all values of 7, are shown in Table 2 
from which the f-values can be checked by the expression (16); for 
example 


DS Gen tees 


ry 


I 


I 


505.8357 — 360.9718 — 113.1182 — 158.0607 

— 126.31 

= —4°(1)/10 + 7°(3)/20 + 16°(12)/40 + 137(12)/50 
—(1.60 + 7.35 + 76.80 + 40.56). 

From the c’s and the f’s kg is now computed as 

kg = 45.79 + [505.8357(.053824) + 436.5532(.063205) 

+ 212.4332(.056530) 


I 
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TABLE 2 
Terms Usep In CaLcuLATING kg 

1 ri fin Siro fi,ss fia 

1 1.40 .9400 2.9600 4.8600 5.4400 

2 3.05 7.6725 19.8900 12.0400 22.1725 

3 7.20 83.5200 83.5200 97.2800 

4 9.14 HOME 2032 EO le 203 2m 2a Oo me oROns? 

S 25.00 312.5000 312.5000 — = 
Total | 45.79 505.8357 436.5532 212.4332 236.9057 

a —figa9 —fijss jf —fi,os —fios 58! 

1 .3200 .7800 1.4400 2.1600 3.6800 6.7200 

2, 5.3550 2.3700 7.2975 8.3400 20.8950 11.1300 

3 — 60.4800 99.8400 —— aoe 99.8400 

4 2.7968 49 4832 49 4832 49.4832 49.4832 56.9868 

5 312.5000 — — = see cae 
Total | 360.9718 113.1132 158.0607 59.9832 74.0582 174.6768 


— 2[3860.9718(.036902) + 113.1132(.025373) 


= 77.16. 


+ 59.9832(.025393) ] 


This procedure for obtaining R, and kg may not appear greatly more 
straight-forward than inverting the matrix required in (8) which in 
this case is the 8 X 8 matrix 


[10 


1 
2 
eS 


20 d 6 4 
40 2” 012 


50 1212. 13 


a 121225 53 
Gy O92 25 45 


Ai 32 = - (0 32] 


Advantages are apparent however, when one considers the case of a 
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large number of levels of the random classification 500 say, instead of 5. 
The matrix to be inverted would then be of order 503 while the pro- 
cedure outlined here would still only require inverting a 3 X 3. The 
calculation of its terms the c’s and of the terms for kg , the f’s, is still 
lengthy but can be accomplished separately for each 7 and summation 
made over 7. This can be arranged quite straightforwardly for a desk 
calculator and is easily organized for an electronic computer such as 
the IBM 650, for example. 
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OPTIMUM SAMPLE SIZE IN ANIMAL 
DISEASE CONTROL’ 


A. W. Norpsxoe?, H. T. Davin? anp H. B. Ersensperc! 
Iowa State University, Ames, Iowa, U. 8S. A. 


1. INTRODUCTION 


The objective of an animal disease control program would either 
be (a) to completely eradicate a disease or alternatively (b) to reduce 
the incidence of a disease to a low level and then to keep it in check. 
In the case of (6) the causative organism might not be completely 
stamped out. 

If a disease has not obtained a strong foothold in a country, then 
objective (a) clearly is justified and perhaps the most drastic methods 
of control are in order. If, on the otherhand, a disease is widely dis- 
tributed both geographically and zoologically, then alternative (b) 
might be more realistic. This implies that the objective of a particular 
control program should take into account the cost of the program 
relative to the price of the consequence if there were no control. 

A case in point is the method of control for Pullorum disease 
(Salmonella pullorum) in poultry. The active form of the disease 
causes an enteritis in baby chicks which may result in high death 
losses. Chicks which recover may be carriers of the disease as adults. 
Female carriers may then transmit the organism via the egg to their 
progeny, where the disease again may become epidemic. Other species 
of domestic birds, and numerous wild species including pheasants, 
quail, and even foxes, cats, swine, cattle and man, have been reported 
as potential carriers of Pullorum. 

Control of this disease is based on a blood testing technique. 
Samples of blood, collected from each member of an adult flock, are 
tested against an antigen of pullorum organisms for agglutinizing 
antibodies. Positive reactors to the blood test are judged to be disease 
carriers and therefore are removed from the flock and slaughtered. 

The blood agglutination test, carried out in most states for the past 
three or four decades, has now reduced the disease to one of relatively 


1Journal Paper No. J-4047 of the Iowa Agricultural and Home Economics Experiment Station, 


Ames, Iowa. Project No. 1326. 
2Department of Poultry Husbandry. *Department of Statistics. ‘Department of Statistics; now 


at System Development Corporation. 
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minor importance. For example, the number of reactors reported by 
participants in the Lowa Control program was less than .07 of one 
percent in 1959 while in the Massachusetts program no positive reactors 
were reported in that year. 

When a disease such as Pullorum is widely distributed but at a low 
incidence on a state-wide or national level, it becomes problematical 
whether to continue past practices of completely blood testing all birds 
in a flock, or whether blood testing only a sample, or even no birds at 
all, of each flock might be more economical. 

The present paper investigates the extent to which 100 percent blood 
testing is justified, assuming random, that is binomial, infestation of 
adult flocks. Specifically, we attempt to determine the most economical 
flock proportion to be blood tested in any given year and locale, as a 
function of certain values and costs and of the prevailing binomial 
infestation rate. 

Economic optimizations in the presence of binomial a priori distri- 
butions have been studied previously, as for example in [1], [2], [3] 
and [4]. However, this prior work is almost entirely concerned with 
industrial inspection problems. The present paper is intended in part 
to illustrate the fact that such methodology, initially intended for 
industrial application, is equally applicable in the biological realm. 

We recognize that the assumption of binomial infestation might in 
certain instances be improved upon, by suitable contagion models for 
example. However, this assumption seems not unreasonable in the 
case of Pullorum; in this case, the adult flock is composed of birds 
which, though able to transmit the disease to their progeny through 
the egg, have low contagious influence as carriers. 


2. ALTERNATIVE FLOCK TESTING POLICIES; 
VALUES AND COSTS 


We consider the following one-parameter (the parameter is 7) 
family of flock testing policies: 


Sample and blood test » birds out of a flock of N. If the 
sample contains no reactors (reaction assumed equiva- 
lent to infection), return the n birds to the flock and 
blood test no more. If the sample contains one or more 
reactors, blood test the entire flock; slaughter all dis- 
covered reactors. 


The relative economic worth of each of these N + 1 competing 
flock testing policies are now evaluated in the light of the following 
values and costs. 
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as blood test cost per bird, 

Pe carcass value per bird, 

a(k): average value of a member of a flock of size N that 
contains k undiscovered reactors, 

b(k): value of a non-reactor from a completely tested 
flock of size N containing k discovered reactors. 


Note that a(0) = b(0) is the value of a member of a flock con- 
taining no reactors. Again, the value of a discovered reactor from a 
completely tested flock is c. Finally, the precise shapes of the function 
a(k) will depend on the likelihood that the progeny from a mating 
involving an undiscovered reactor will become acutely infected, thereby 
precipitating an epidemic in the progeny flock. The shape of the 
function b(k) will relate largely to loss of good-will. If good-will is 
not an important factor, it will not be unreasonable to set b(k) = 6(0) 
for all k; indeed, this is the assumption made at the end of Section 5. 


3. PRELIMINARY CONSIDERATIONS 
Computationally tractable forms of a(k) and b(k) are as follows. 
a(0) = A,a(k) =a for k #0, 
b(0) = A,'b(h) = 6 tor k& =-0. 


(1) 


It is shown below that the most profitable n is either 0 or N. If this 
can, for the moment, be assumed, then the derivation of the most 
profitable sample size (1.e. the choice between 0 and N) becomes a 
matter of simple arithmetic, at least for the limiting cases of very small 
or very large flock size NV. 

Consider a large number MW of birds, grouped into m flocks of 
N = M/m birds. The case of very small flock size is typified by N = 1, 
while the case of very large flock size is typified by m = 1. 

For N = 1, the average value of the M birds (i.e. one-bird flocks) 
equals 


M(1 — A+ Mra, if n= 0, (2a) 
M1 — A+ Mae — Mi, if n = N (=1), (2b) 
so that 100 percent testing will be more (less) profitable than no testing 
according to whether 
} rc — a) > (<) @. (3) 


For m = 1, the average value of the M birds (i.e. one flock of M 
birds) equals 
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M(1 — ma + Mra = Ma, if n=0, (4a) 
M(1 — 7)6 + Mre — Mi, if n= N(=M), (4b) 


so that 100 percent testing will be more (less) profitable than no testing 
according to whether 


(eet We GOLA ar (5) 


Expressions (2a) and (2b) arise from the fact that, under both zero 
and complete testing, a proportion (1 — 7) of the M one-bird flocks 
will be disease-free, contributing an amount M(1 — aw) A to average 
value. In addition, there will be a value contribution from the Mr 
one-bird flocks containing a reactor; this contribution will amount to 
Mra in the case of no testing and to M7c in the case of 100 percent 
testing. Finally, there is the testing cost Mz that is incurred under 
100 percent testing. 

Expressions (4a) and (4b) arise from the fact that a very large 
flock will contain an approximate proportion 7 of reactors; hence, 
under 100 percent testing, there will be approximately Ma birds con- 
tributing carcass value c to the flock, and approximately M(1 — 7) 
birds contributing value 6. Similarly, if WM is large enough to make 
Mr large, the flock of M birds will contain at least one reactor with 
probability essentially 1, leading to an average per-bird value of @ in 
the absence of testing. 

The criteria represented by (3) and (5) constitute an almost adequate 
solution for the case of the value assumptions given in (1). The further 
computations of Section 4 will serve only to validate the assertion that 
the most profitable sample size must either be zero or N, and will lead, 
as well, to the analogues of (3) and (5) for flocks of intermediate size. 


4. THE COMPUTATION OF THE MOST PROFITABLE SAMPLE SIZE 
FOR THE CASE OF CONSTANT VALUE DIFFERENCE 0(k) — a(k) 


This is the case typified by form (1) of the functions a(k) and b(k). 
The computations proceed as follows. Let 


¢, = the binomial probability of k reactors in a flock of 
Na & eee (6) 
and let 


h,., = the hypergeometric probability of obtaining 7 non- 
reactors when drawing a random sample of size n 
from a flock of N birds containing k reactors and 
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(N — k) non-reactors = @ be SAC) (7) 


nr n 


Then for any particular one of the (V + 1) alternative policies pre- 
sented in Section 2, say the policy corresponding to sample size n, 


Pr {number of reactors in the flock = k; number of re- 
actors in the sample = 0} = ¢,ha.x , (8) 


and the “profit’’ ensuing if the number of reactors in the flock equals 
k and the number of reactors in the sample equals 0 is 


Na(k) — ni. (9) 


Hence the contribution to expected profit of the events involving no 
reactors in the sample equals the sum of the products of (8) and (9): 


N-n 


D [Nalk) — ni}-be-Fn - (10) 


k=0 
Again, for the same sample size n, 


Pr {numbers of reactors in the flock = k; number of re- 
actors in the sample > 0} = ¢,-(1 — h,.,), (11) 


and the “profit”? ensuing if the number of reactors in the flock equals 
k and the number of reactors in the sample exceeds 0 is 


(N — k)-b(k) + ke — Ni. (12) 
Hence the contribution to expected profit of the events involving one 


or more reactors in the sample equals the sum of the products of (11) 
and (12): 


> ((N — k)-b(k) + ke — Ni)-b-(1 — fn). (13) 


Total expected profit will equal the sum of (10) and (13), which, 
neglecting terms not involving n and using the fact that h,,, = 0 for 
k > N — n +1, can be written 


SNe rg) ae ty = alec, Same) 


Expression (14) is further reducible to 
(1 — x)": {N-Ela(k) — b(k)] + E[k-(0() — &] + (W — n)-1}, (15) 


where the expectation E[ ] is with respect to the chance variable k 
having a binomial distribution with parameters (V — n) and x. This 
type of expectation arises from the fact that 
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bln t = d — a (% a "r( vas a 


Further reduction leads, except for the additive term (1 —7r)*-N(B—a), 
to the expression 


(1 — 7)(T — nS) = PC), (16) 

where 
T= Na—Stas ao 7) (17) 
S = 7(6i— co) = O- (18) 


Differencing P(n) now yields 
A(n) = Pn + 1) — P(r) = (1 — w)"[(mr — YS — a(T — S), 


which shows that A(n) is negative for n < (7/S) + (1/r) — 1, and is 
positive for n > (T/S) + (1/r) — 1. This implies that no P(n) for 
0 <n < N can be larger than the larger of P(0) and P(N), which in 
turn implies that the most ‘profitable’ sample size n either is 0 or NV. 
Establishing this fact (which is reminiscent of conclusions reached in 
[1] and [2]) was one of the two objectives set for this section in the last 
paragraph of Section 3. 

The second objective set for this section was to derive a condition 
analogous to (3) and (5) for determining the relative profitabilities of 
the two sample sizes 0 and N for intermediate flock size N. But this 
now is simply a matter of comparing P(0) and P(N), where P(n) is 
given by (16). This yields the conclusion that 100 percent testing is 
more (less) profitable than no testing at all according to whether 


(8 — a1 — (1 — )*] > (<) (8-0 +7. (19) 


We note that (19) does indeed specialize to (8) and (5) for N equal, 
respectively, to 1 and to ». 

Although details are outside the scope of this paper, it may be of 
interest to point out that condition (8) arises naturally in the computa- 
tion of the sequential Bayes test policy for the value assumptions (1). 
Consider the much (though no doubt impractically) enlarged set of 
flock testing policies consisting of all sequential plans with terminal 
acts A and R: 


A : Stop testing; collect ¢ for every reactor culled out so far; return 
non-reactors to the flock; eventually collect a or 6 per bird of 
the unculled portion of the flock, depending on whether or 
not this portion contains at least one reactor, unless the entire 
flock is reactor-free, in which case collect A. 
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R : Test 100 percent; collect ¢ for reactors, and 6 for non-reactors, 
unless the entire flock is reactor-free, in which case collect A. 

A straightforward application of the methodology given in [2] then 
shows that 100 percent testing is the most economic of the policies in 
this enlarged set if 7(¢ — a) > 7. However, the complementary pre- 
scription for no testing if r(¢e — a) < 7 does not apply in this case. 


5. THE COMPUTATION OF THE MOST PROFITABLE SAMPLE SIZE 
FOR THE CASE OF LINEARLY INCREASING VALUE 
DIFFERENCE 0b(k) — a(k). 


A value assumption alternative to (1) is 
ah) = A-2(A-y), OSKEN, 
‘ (20) 
b(kk) = A — 2 (4 — 6), OS Se, 
Using (15) [which was derived without reference to any specific form 


ot a(k) and b(k)], expected profit now becomes, except for additive 
constants not involving n, 


n 


P(n) = (1 - i Jar + nS), (21) 


where 
T=(A-d)a—r)\(N-)D+N[ry —o +7), (22) 
S = (A — 5)’ > 0. (23) 


The function P(n) given by (21) is best described in terms of the follow- 
ing five parametric cases. 

Case I: S > 0, T = 0. As n increases from — ©, P(n) rise steadily 
from — ~, crosses the n-axis at n = —T/S, equals T at n = 0, turns 
downward somewhere between n = —T/S and n = N, crosses the 
n-axis once more at n = N, turns upward somewhere beyond n = N, 
and approaches the -axis asymptotically from below as approaches 
+ ©. 

Case II: S > 0,0 > T > —NS. As n increases from —o, P(n) 
rises steadily from —, equals 7’ at n = O, crosses the n-axis 
at m = —T/S, turns downward somewhere between n = —T/S and 
n = N, crosses the n-axis once more at n = N, turns upward somewhere 
beyond n = N, and approaches the n-axis asymptotically from below 
as n approaches + ©. | 
Case III: S > 0, T < —NS. As n increases from — ©, P(n) rises 
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steadily from —@, equals 7’ at n = 0, crosses the n-axis at n = N, 
turns downward somewhere between n = N and n = —T'/S, crosses 
the n-axis once more at n = —T7’/S, turns upward somewhere beyond 
n = —T/S, and approaches the n-axis asymptotically from below as n 
approaches + ©. 

Case IV S = 0; FS 0. ‘As mn incretses rome ac, P(n) decreases 
steadily from +, equals 7 at n = 0, crosses the n-axis at n = N, 
turns upward somewhere beyond n = N, and approaches the n-axis 
asymptotically from below as n approaches + ©. 

Case V: S = 0, T < 0. As n increases from —~%™, P(n) increases 
steadily from — ©, equals 7’ at n = O, crosses the n-axis atn = N, 
turns downward somewhere beyond n = N, and approaches the n-axis 
asymptotically from above as approaches + ©. 

Cases IV and V are easily summarized as follows: If A = 6 (corre- 
sponding, as indicated at the end of Section 2, to the absence of the 
good-will factor), then the only policies in contention are 100 percent 
testing (n = N) and no testing (n = 0), and 100 percent testing will be 
more (less) profitable than no testing according to whether 


Cyaan (24) 


It seems of interest to note the resemblance of (24) and (8). 

Cases III and V are summarized by: If 7 + NS < 0, it is most 
economical to test 100 percent. 

For Case II, the most economical sample size is the n between 
—T/S and N at which P(n) turns downward. This case is of interest 
since it establishes the possibility of an optimum sample size other 
than 0 or N. This possibility has already been noted in [3]. 

For Case I, the most economical sample size is either n = 0 or the 
n at which P(n) turns downward, depending on the relative magnitudes 
at these two sample sizes. Note that, for T = 0, the n at which P(n) 
turns downward must be greater than zero, so that, as in Case II, the 
most economical sample size will be other than 0 or N. 


6. EXAMPLE AND CONCLUSIONS. 


Consider the case when good-will is not a factor, i.e. the case of 
constant b(k). In this case both the formulation of Section 4 and that 
of Section 5 imply that only the two policies of zero and 100 percent 
testing are in contention, the choice between these depending on the 
direction of a simple inequality. 

Defining the critical testing cost 1, to be the testing cost for which 
zero and 100 percent testing are equally economical, it seems of interest 
to compute 7, for both formulations, using comparable value figures. 
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A typical value for a non-reactor in a fully tested flock is $3, a 
typical average value of a member of an untested flock is $2.40, and 
typical values for c, z and N are $.50, 10°° and 500. 

Computing 7, in the spirit of Section 4, we therefore set A = B =$3 
and a = $2.40. Replacing 1 — (1 — 7)” by Nz (allowable since Nx 
is small), criterion (19) then becomes $3 X 107° > (<) $2.5 X 107° + 4, 
which means that 7, = $0.003, i.e. that no testing is most profitable 
unless the cost of testing falls below 3 mills per bird. 

Computing 7, in the spirit of Section 5, we set A = 6 =$3. In 
addition, we interpret $2.40 to be the value of the linear function a(k) 
evaluated at k = Nz, the expected number of reactors in the flock. 
This implies a per-bird disaster value of y = —$6 X 10* for an un- 
suspected 100 percent infected flock. Criterion (24) then becomes 
$0.6 > (<)z, which means that 7, = $0.60, i.e. that 100 percent testing 
is most profitable unless the cost of testing rises above 60¢ per bird. 

In practice, the cost of testing is approximately seven cents per 
bird. Since this cost is well bracketed by the critical costs 0.3¢ and 
60¢ derived above, we learn that the shape of the value function a(k) 
[and of course also that of b(k)] must be determined rather accurately 
if the methodology presented here is to be applied. 
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NUMERICAL ASPECTS OF THE REGRESSION 
OF OFFSPRING ON PARENT’ 


H. E. McKean anv B. B. BOoHREN 
Population Genetics Institute, Purdue University 
Lafayette, Indiana, U. S. A. 


INTRODUCTION 

In an earlier paper (Bohren, McKean, and Yamada, [1961]) three 
currently employed techniques for estimating the regression of off- 
spring on parent, and thereby heritability in the narrow sense, were 
compared and contrasted with respect to their efficiencies of estimation. 
The general conclusion, based on theoretical considerations and an 
empirical study of five generations of a closed poultry flock (Yamada, 
Bohren, and Crittenden, [1957]), was that under the circumstances 
considered, the method of regression of offspring means on parent’s 
record (method 1) was inferior to the method of regression of individual 
offspring on parent (method 2) (with the parent’s record repeated once 
for each of its offspring) and to (method 3) the Kempthorne-Tandon 
technique (Kempthorne and Tandon, [1953]). 

The success of the Kempthorne-Tandon technique depends upon 
knowledge of a parameter p, the correlation between deviations of two 
offspring of the same parent from the predicted breeding value of the 
parent, and its expected superiority over the second method depends 
upon the magnitude of p. Usually p is guessed in the light of prior 
knowledge, and weights are assigned to the families according to the 
guessed value of p. In the first paper it was shown that, under the 
assumption of all genetic variance being additive, p < .067 or < .079, 
depending upon whether the mating structure is random or hierarchal. 

The results obtained in the previous paper specifically depended 
upon the particular distribution of family sizes encountered in the five 
analyses, and upon the accuracy of the estimated values of T=p/(1—p). 
The purposes of this paper are to consider the efficiency loss incurred 
by mis-guessing p in the Kempthorne-Tandon technique, and to in- 
vestigate the factors involved in the relative efficiency of the other 
two methods. 


THEORY 


We consider a breeding experiment in which s sires are selected 
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from the population, sire 7 being mated to a random sample of d; distinct 
dams. The mating structure is then hierarchal, with the progeny of 
sire ? having phenotypic values given by the model 

Vij. = ae + B(X;; == A) Chik , (1) 
which is equation (11) of the previous paper. 

It has been pointed out that the three methods under consideration 
are merely special cases of the general unbiased weighted regression 
estimation procedure. The difference between the methods involves 
only the weights applied to each progeny-group deviation: (1) 40,0" 
for the progeny means on parents technique, (2) w;; = n;; for the 
repeated parents technique, and (3) w,;; = n,;/(1 + n,;T) for the 
Kempthorne-Tandon technique, where 7 is a guessed value of 7. If 
t = T, the third technique is the minimum variance technique, whereas, 
if + = O, the third technique reduces to the second. F urthermore, 
when the family sizes are equal (ni; = niz = --- , for all 2), all three 
methods are identical in efficiency. Since most experimental data 
will involve unequal family sizes, interest will center on considering 
this situation. 

The question, ““When may I use method 1 with little loss in effici- 
ency?’’, is a pertinent one. First, let us examine the optimum choice 
of weights for which the variance of the estimate of 8 will be minimized. 
These optimal weights w* (say), where w* = n;,;/(1 + n,;T), will be 
approximately equal (hence method 1 appropriate), for 7’ > 0, under 
one or more of three distinct sets of circumstances: 

1) All n,; are large. This follows immediately from 


li | tr | == 

goes 1 = n,;T cs fh 
2) SS , the variance of the family sizes, is zero or very small. 
3) T is large. This follows from the fact that 


* 
wi, 
Hite al, 
To Wir; 
(or p>1) 


independent of n;; and ;-;, ; thus for large 7, w* = w*,,,. In view of 
this, method 1 may also be considered as a special case of method 3 
where 7, the guessed value of 7’, is allowed to approach o. 

It is of interest, therefore, to determine a readily accessible statistic 
which depends upon 7% (the average dam family size), S* , and 7’, upon 
which a decision to use or not use method 1 as opposed to method 2 


may be based. . 
It is easy to show that the coefficient of variation among the optimal 
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exp (xl(q — pz)” — I}, \ Oe ee Oe De aU, Gale pe (1) 


It is believed that most of the deviation of the distribution of the 
survivors from the simple distributions mentioned above stems from (i) 
the complex structure of survivors within an egg mass and (ii) the 
movement of survivors from place to place. Since the Negative Bi- 
nomial distribution has been found to be very useful in fitting data 
involving this type of heterogeneity, it is reasonable to suppose that 
the Poisson Pascal will give a better description of the population of the 
survivors in a field. 

The Poisson Pascal distribution can also be looked upon as the 
result of compounding a Pascal distribution with p.g.f. (¢ — pz)" by 
taking k, to behave as kx where k is a positive constant and x a Poisson 
random variable with p.g.f. exp {A(2 — he 


3. SOME PROPERTIES OF THE POISSON PASCAL DISTRIBUTION 


The limiting forms that the Poisson Pascal distribution takes as the 
parameters take on extreme values are given in Table 1. It is to be 


TABLE 1 


Some Limitina Forms oF THE PoIssON PascaL DISTRIBUTION 


No. Limits Taken Name and p.g-f. of the limit 
1 k—> ©,p—-9 Neyman Type A, exp {Mexp (u(z — 1) — 1}} 
pk = 
2 k—-0,’\7> © Negative Binomial, (¢ — pz)": 
rk — ky 
3 p—70,+7 © Poisson, exp {M(z — 1)} 
Akp = Mt 


noted that the Neyman Type A and the Pascal distributions are among 
the limiting forms. 

The flexibility of the Poisson Pascal was compared quantitatively 
with that of the Neyman Types A, B, C, Pascal, and Poisson Binomial 
by evaluating the relative skewness and kurtosis of each for fixed 
mean k,p,; and variance k,p,(1 + 7x) using the indices x;3;/kip; and 
ka;/kip’ respectively along the lines of Anscombe [1]. The range of 
numerical values of these indices for each of the foregoing distributions 
is shown in Table 2. It is apparent that the Poisson Pascal covers the 
entire range of distributions from Neyman Type A to Pascal with 
respect to skewness and kurtosis. Also, the ranges of these ratios for 
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TABLE 2 


CoMPARISON OF SKEWNESS AND KurTosIs oF CERTAIN DISTRIBUTIONS 


No. Name Range of Skewness Range of Kurtosis 
1 Neyman Type A 1 1 
2 Neyman Type B 9/8 27 /20 
3 Neyman Type C 6/5 8/5 
4 Pascal 2 6 
5 Poisson Pascal (1,2) (1,6) 
6 Poisson Binomial (0,1) (0,1) 


the Poisson Pascal and the Poisson Binomial are disjoint. Since their 
p.g.f.s have the common form 


exp {A[(¢ = pz)" =“1]};, (2) 
we observe that the distribution with (2) for p.g.f. wherein X > 0, 
q=1+p,p > O0whenk > Oand —1 < p < 0 when k is a negative 
integer, covers a very wide range of distributions. As an aid in com- 
puting the values of the ratios for the samples to obtain an idea as to 
how close the sample distribution is to the various distributions men- 
tioned in Table 2, formulae to compute the factorial cummulants «;,, 
using sample moments are given in Appendix A. 

Since the first two frequencies were large in the sets of data to 
which this (and similar) distributions were fitted, the ratio of the first 
two frequencies were compared for some of these distributions. It 
can be easily shown that the value of this ratio for the Poisson Pascal 
distribution lies between the ratios for the Neyman Type A and the 
Pascal distributions. The ratios are given for brevity in Table 3. 


TABLE 3 


CoMPARISON OF THE RATIO OF THE FIRST TWO FREQUENCIES WITH MEAN 
AND VARIANCE FIxep AS kiP; AND kiPi(1 + pi) 


No. Distribution Ratio of Frequencies 
1 Neyman Type A kip: exp (—71) 
2 Neyman Type B 2k, {1 — exp (—3p1)(1 + 3p1)}/(pim) 
- 3 Neyman Type C 3k1{2p1 exp (—4pi) + 2[—2 exp (—4p:) 
+ 64p? + 4p.)}/4p3 
4 Pascal kypi(l + pi) 
5 Poisson Pascal kupi(l + pi)/(& + 1)-* 


re ee eS eS SSS ee 
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4. FITTING POISSON PASCAL AND EFFICIENCY OF 
METHODS OF ESTIMATION 


Since obtaining maximum likelihood estimates is very cumbersome, 
ad hoc methods were used to estimate the parameters. When the mean 
and the variance were large, use was made of the method of the first 
three moments. When they were moderate and the proportion of the 
zero frequency large, use was made of the method of the first two 
moments and the proportion of the zero frequency. When the first 
two frequencies were large in comparison with the remaining, the 
method of the first two moments and the ratio of the first two fre- 
quencies was used in estimation. The equations for estimation are 
given in Appencix C. The fit of this distribution to the data of Beall 
and Rescia [2] are given in Tables 4 and 5. For the sake of reference, 
the fit of a generalization of the Neyman Type A as given by the authors 
of these data are also given alongside. The relatively good fit of the 
Poisson Pascal is apparent. 


For obtaining a comparison of the various methods of estimation, 
it was decided to compute the efficiency function 


‘TABLE 4 


Fir oF THE OBSERVED FREQUENCY oF LespEDEZA CapiraTa, TaBLEe V oF [2] 


Expected frequency 
Observed due to Poisson Pascal Expected frequency 
Plants Frequency (Method of Moments) as in [2] 
0 7178 7185.0 (PAWS 
1 286 276.0 218.6 
2 93 94.5 105.5 
3 40 41.5 50.9 
4 24 20.2 24.5 
5 cf 10.4 11.8 
6 5 5.6 A 
7 1 Sel 2.8 
8 2 il sé LES, 
9 1 1.0 6 
10 2 6 o 
i+ 1 3 A 
5 =a 9.58 42.97 
Degrees of 
Freedom — 8 9 
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E = 1/(Generalized variance X Information determinant) 
(cf. Cramer [4], pp. 489-497). (3) 


If we denote the parameter vector (A, p, k) by (A, , Xe , As) and the 
set of statistics used by (t, , ts , t,), we get the expression for the general- 
ized variance of the estimates as 


TABLE 5 


Fit oF OBSERVED FREQUENCY OF LEPTINOTARSA DECEMLINEATA, 
TaBLE III oF [2] 


Expected frequency due to 
Poisson Pascal, (method of Expected 
two moments and first frequency 
Insects Observed frequency as in [2] 
0 33 33.0 39.5 
1 12 9.8 6.0 
2 5 Ce 4.9 
3 6 5.5 3.4 
4 5 4.0 3.2 
5 0 2.9 2.5 
6 Pe Daal 2.0 
i 2 Ls 1.6 
8 2 Wal iS 
9 0 8 1.0 
10 1 6 8 
11+ 2 ile 3.3 
x? — 6.88 NBS 
Degrees of 
Freedom — 8 9 


2 


G a | V(t, , te ,ts) /| Ott» Ta 7) ) (4) 


A(Ay »Y2, Xs) 


where 7; , 72 , 73 are the functions of ), , \, and A; estimated consistently 
by 4,,%,¢. A proof of this is given in Appendix B. Since evaluating 
the covariance matrix V(¢, , t, , ts) and the derivatives of t, , t2 and ¢; 
follows from the regular statistical techniques, no elaboration need be 
made here. A formula for the information determinant (cf. Shenton 
[10]) is 
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Ldpeors leaPpeors 1 OP, oP, 
Lp. Soh gE P. oo. pe Peon one 
ee 1 OP, oP. 1 OP, OP; ‘1 OP, oP; . 
wes Lp. aan Lp. oe Lp. ero (5) 
suk aP, oP, 41 aP,8P, aP, oP, 
2 P.Dry Oy Pz Org Or 4’ Pz OAs OAs 


The principal problems in evaluating [ for a value of (A, p, &) therefore 
are (i) to evaluate the various P, and the derivatives of P, and (7) 
to determine the number of terms to be used in summing the infinite 
series. To obtain P, , let 


g(2) = exp {A(q — pz)” — 1} (6) 
and 
h(z) = (q — pe)". 


By differentiating (6) successively we get 


g'(2) = \g@h'@), (7) 
and 
go @) = x ys ig et Ne @Aer-@, (8) 


Set z = 0 in equations (6), (7) and (8) and observe that gq Olas 
and h‘”(0) = x! 7, where 


eer iene )e z—k—-z 
Pi payee REG ae (9) 


is the probability of z in the Pascal distribution with h(z) as p.g.f. 
Then we have the recurrence formulae 


Pyercxpai niga all (10) 
and 
pee abs oN =o iP, (11) 


which can be repeatedly used to evaluate any P, . 


The various derivative are given by 


oP, Ve y F 
== oe = ‘3 ee = ae [g(2(h(e) — wy} 


Eis 2 & 
aes +E DP.2 (1 + ry, (12) 


z=0 
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Pe Pe 118 | 2 | elo Ene 
ON: op —x!- de" ap 9@) So ecel lee me Pe 


(a@P,— (@+1)P.} (18) 


oPs OP 1 fa [2 ] 
O43 ok cz! loz? Lar (2) vis, 


ee & [—Ag@h@) log (¢ — pa 


x! 
Pls Pee ker 
= ip 2 0/0" | (ae — t+ UP. cas — v@ — OP. 1} 


1 
5 (log g){q@ + I)P.s1 — prP,} (14) 
for all x. 


TABLE 6 


EFFICIENCY OF THE Mrtuop oF THE First THREE MoMENTS FOR 
THE Poisson PascaL 


k 
N Pp 1 3 5 1.0 2.0 
1 84 82 82 81 76 
am 3 59 58 58 54 AT 
“a 5 45 44 43 40 33 
i 1.0 26 25 24 uP 18 
a 2.0 13 12 12 12 13 
5 at 90 81 82 HG 67 
5 3 59 58 .56 49 35 
5 5 46 44 Al 34 22 
5 1.0 26 25 23 18 10 
5 2.0 13 438 “2 = oe 
1.0 a 81 83 82 75 63 
1.0 3 59 59 55 46 28 
120 5 46 44 .40 31 15 
1.0 1.0 27 25 iz 15 05 
1.0 = = = — oe = 
5.0 1 94* .99* .78* 58* 21* 
5.0 3 62* .56* AL* 11* 01* 
5.0 5 4* .38* .20* 04* 00* 
5.0 1.0 29* 17* 06* 02* 00* 
5.0 = == = = = = 
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To determine the number of terms we use the following rule: 

Let 7,;(n) denote the (n + 1)th term in the series involved in the 
(, j)th term of matrix (5). Let S;;(n) = Dot%-o T(r). Compute 
S;,(n) and Die, Pir) Si) ton ak - et cetera successively 
till a value of 1 is reached for which 


Dd Tii(n)/Sii(n) < 10. (15) 


bo 


It is clear from (15) that | 7;;(n) |/| S:;(n) | <10~™ for each 2 and j. 
If the series converge faster than a geometric series with common 
ratio less than 0.9 and this convergence starts before the value of n is 
reached for which (15) is satisfied, the calculated efficiency will be 
correct to three significant figures. If the significant figures do not 
cancel out, this should yield the efficiencies computed therefrom, correct 
to three decimal places. When the inequality (15) was not satisfied 
for values of n < 20, the partial sum of the first twenty terms was 
taken as the value of the series since evaluating the terms of the series 


TABLE 7 


Erricrency oF THE MretTuHop oF THE First Two MoMENTS AND THE FIRST 
FREQUENCY FOR THE Poisson PascaL DISTRIBUTION ATA = 0.1 


k 
N p 1 3 5 1.0 2.0 
al a 99 98 98 98 98 
7 3 93 94 95 94 93 
el 5 90 90 91 91 89 
a 1.0 82 83 84 84 83 
A 20 74 76 78 84 78 
5 a 1.00 1.00 98 96 95 
5 3 93 94 93 92 87 
5 5 91 91 90 89 82 
5 1.0 83 85 .85 82 73 
5 2.0 75 78 81 = a 
1.0 v1 1.00 99 97 96 94 
1.0 ig 94 96 94 93 87 
1.0 5 92 92 91 98 81 
1.0 1.0 84 85 86 82 72 
1.0 2.0 88* 87* .87* ie 83% 
5.0 zi - = .99* .89* 98* 
5.0 3 97* 99* 1.00* 1.00* 98* 
5.0 5 95* .97* .98* 99* 96* 
5.0 1.0 .90* .96* 97* a — 
5.0 20) .89* = ae 2s = 
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for n larger than 20 is very time consuming. The efficiency when n 
was restricted to 20 is marked with an asterisk to indicate that they 
are less likely to be correct to three decimal places. When the efficiency 
so computed was larger than one (due to the inaccuracy in computing 
the information determinant), the corresponding cell in the efficiency 
table is left blank. 

The efficiency of the method of moments is given for certain values 
of (A, p, k) in Table 6. The efficiency of the method of the first two 
moments and the first frequency is given in Table 7 and that of the 
method of the first two moments and the ratio of the first two fre- 
quencies in Table 8 for the same values of (n, Dp ky: 

It is apparent that the method of the first two moments and the 
ratio of the first two frequencies has high efficiency and is superior to 
the other two methods when \ is small (and consequently the first 
two counts account for a large proportion of the observed frequencies). 
Also the method of the first two moments and the first frequency is 
highly efficient when is moderately large. When A, p or k approaches 


TABLE 8 


EFFICIENCY OF THE Metuop oF THE First Two MoMENTS AND THE RatTIo or 
THE First TWO FREQUENCIES FOR THE Poisson Pascal ATX = 0.1 


k 

x p 1 3 5 1.0 2.0 
a a 1.00 1.00 1.00 1.00 1.00 
ok 8 98 99 .99 99 98 
a 5 99 98 .98 97 94 
1 1.0 94 94 93 91 83 
ot 2.0 87 88 .89 91 99 
5 a 1.00 99 1.00* 1.00 99 
5 2 98 1.00 .99 98 96 
5 1.0 99 98 97 96 92 
5 2.0 94 94 93 89* 80* 
1.0 | 98 1.00 1.00 1.00 99 
1.0 3 99 1.00 99 99 95 
1.0 5 1.00 99 .98 96 91 
1.0 1.0 95 94 93 88 79 
1.0 2.0 88* 89* 91* a = 
5.0 x = _ .99* .95* 91* 
5.0 “B 1.00 99* .98* 89* 76* 
5.0 5 1.00 96* .91* 80* 65* 
5.0 1.0 95* .88* .79* 84* 1.00* 
5.0 2.0 90* .98* = = =f 
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infinity, it can be shown by calculus that the method of moments is 
much superior to the other two but since the efficiency of each of these 
methods tends to zero for such values, this has little significance. 


5. CONCLUSIONS 


On the basis of the properties discussed in Section 3 and the fitting 
in Section 4, we observe that the Poisson Pascal distribution acts as a 
bridge between the Neyman Type A and the Negative Binomial dis- 
tributions and may be used with advantage when the latter distribu- 
tions are inadequate to represent the population accurately. 

From the tables of efficiency, it is clear that in the region of tabula- 
tions, at least one of the ad hoc methods of estimation suggested above 
has high efficiency. It is believed that in practice, (A, p, &) will not 
be far beyond the region of tabulation and that one of these methods 
can be used without too much loss of information. Techniques for 
choosing one of the many ad hoc methods on the basis of the sample 
will be discussed in a future paper. 
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APPENDIX 


A. Formulae to compute factorial cumulants using moments about the origin: 


We first obtain formulae to compute the first four factorial moments 
ts) ,2 = 1, «++ , 4 using moments about the origin y; and then obtain 
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formulae to evaluate the first four factorial cumulants «;;; 2 = 1, --- ,4 
using these factorial moments. 

As for the first objective we note that u,;;=H{x(w@—1)---(@—-7+1)}, 
i = 1,2,---. By expanding the product within the expectation sign 
and using the elementary properties of the expectation operator, we get 


ny = as Miz) = M2 — ML, Mis) = Ms — 3ua + Qui, 

and 

te = be le ys Oy (16) 
As for the latter, we observe that if w(t) and y(t) denote the factorial 
moment generating function and the factorial cumulant generating 
function, then y(t) = log u(é). On differentiating the equation 
successively with respect to ¢ at ¢ = 0 and noting that K,.) = vy? (0), 
we have 


Kay = Bl > ie — (Se) iin) , 
Kish ta MES ly map Se iihi2) Quin ’ 
and 
Kia) = Mi4) 3ui21 — 4pjHis1 1 Quin Mi21 = bein : (17) 


B. To prove formula (4): 

Since 7; , 72 , 7s are functions of ), , A, and A; , let us write them more 
explicitly as 71(A1 , Az , Ag), Te(Ar y Az As) and 73 (Ai, Az , As) respectively. 
If \, , Xe, As are the estimates of A, , Ax, As then using the statistics 
t, , tg , ts Which estimate the 7’s consistently, we have 

t; = Oe Be hr Ue 2 ios 
Hence, we have the asymptotic relations 
OT; OT; OT; et 
al oe Wawa’ Sipe a =) og alae 


which can be rewritten as 


ye = Ay A aa SCA 
O(71 T2 9 oi 
= = | SS te = To | 
Na — Ys Ee KS eX) Zone 
x — hg ly — 73 


The generalized variance G of hi , As, As is then given by 
G= | HA, —-d Fe ea — ds)/Ar = Ay SNe — o,Asz — 3) | 


O(m1 _» Te » Ts) 
= 1 V6, 4,6) |/ | ees 


which is formula (4). 


2 
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C’. Equations for estimation: 

We give below the equations for estimating the parameters for the 
various methods mentioned in Section 4. Their derivations are omitted 
for brevity. 


i. Equations for the estimation of parameters using the first three 
moments are: 


k = ——*wtn _, _ 9 (18) 
Riaikiay = (Rio) ‘ 

P = Ria/(Ri(k + 1), (19) 

N = Ru) /(kp). (20) 


u. Equations for the estimation of parameters using the first two 
moments and the proportion P, of zeros are: 


p log {1 + (Si = ») log Pal a (Se _ ») log’ (lens (21) 


Ki1) Ray Roxy 


k=—@2l 4, dX = &u;/(hp). (22) 


i PR ia) 
il. Equation for the estimation of the parameter p using the first 
two moments and the ratio P;/P, of the first two frequencies is 


log (1 + p) ert 
P Ria) 


k and ) are then estimated by using equations (22) and (23). 


log fener (24) 


SOME RANK SUM MULTIPLE COMPARISONS TESTS 


Rosert G. D. Stern 


Institute of Statistics, North Carolina State College 
Raleigh, North Carolina, U.S. A. 


1. SUMMARY 


Rank sum tests for multiple comparisons and suitable to the 
completely random design with equal sample sizes are discussed. 
Significance tables and some facts about the joint distribution of rank 
sums are given. An example illustrating test procedures and making 
use of the tables is presented. 


2. INTRODUCTION 


A number of nonparametric tests based on ranks have been pro- 
posed for the comparison of treatments in a completely random design. 
For example, we have the Wilcoxon-Mann-Whitney test [21, 10], 
basically a two-sample test with a per-comparison error rate. 

Also, Kruskal and Wallis [9] have proposed a rank test which is an 
analogue of Snedecor’s F-test. This test provides evidence concerning 
the presence of real differences but is of limited use in locating them. 

Steel [16, 17] has presented rank tests for comparing treatments 
against control and for all pairwise comparisons. Both of these tests 
use experiment-wise error rates. 

Pfanzagl [13], as part of a more general theory, has discussed a 
two-step nonparametric decision process based on ranks, for testing 
the null hypothesis that k samples come from the same population and, 
if this is rejected, for deciding which one of the samples comes from a 
different population. No tables are given but it is suggested that they 
might be obtained by random sampling. It is also shown that the 
limiting distribution of the multivariate criterion is multinormal. 

The per-comparison error rate test is sometimes criticized, par- 
ticularly when all possible paired comparisons are made, because it 
will almost certainly lead to false declarations of significance when the 
experiment includes many treatments and if customary significance 
levels are used. It is also deemed inappropriate when the experiment 
is considered to be the conceptual unit. 

The experimentwise error rate test is sometimes criticized because it 
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requires such a large difference to be declared significant that it becomes 
difficult to detect any but the largest real differences when customary 
significance levels are used. Also, it may be that the individual com- 
parison is considered to be the conceptual unit. 

A brief discussion of these error rates is given by Steel [18] in re- 
sponse to Biometrics Query 163. 

Choice of a definition of error rate in the conduct of a particular 
experiment seems somewhat less crucial when it is realized that we can 
compute the significance level for a particular definition of error rate 
from knowledge of the chosen significance level for any other definition 
of error rate. This is not generally a simple computation unless the 
comparisons are independent. In the case of p independent comparisons, 
if a’ and @ are the experimentwise and per-comparison error rates 
respectively, we have the relation: 1 — a’ = (1 — a)’. 

When comparisons are not independent, computation of comparable 
significance levels for different definitions of error rate depends upon 
the extent of the dependence and the nature of the multiple comparisons 
test. No individual is likely to perform such a computation for a 
single experiment. Thus a table needs to be prepared for comparing 
the customary significance levels for differently defined error rates. 
For tests based on an underlying normal distribution, this has been done 
fairly extensively by Harter [4, 6]. 

The experimenter may try to meet the usual criticisms of per- 
comparison and experimentwise error rates by choice of a non-standard 
significance level or an alternative test procedure. Presently, tables 
of significant values for such levels do not appear to be available for 
experimentwise error rates; in the case of alternative tests, several are 
available, including the Newman-Keuls [11, 7] procedure and Duncan’s 
[1,5] new multiple range test, which are sequential in nature. 

This paper is concerned with rank tests, in particular with tables for 
an all-pairs-of-treatments test with an experimentwise error rate, and 
the use of these tables for a fixed rank sum test, an analogue of Tukey’s 
w-procedure (for example, see Steel and Torrie [19]), and for two multiple 
rank sum tests, analogues of the Newman-Keuls procedure and of 
Duncan’s test. Table 2 is used in the first two cases, Table 3 in the 
last case. 


3. CONSTRUCTION OF TABLES 


The proposed tests call for rank sums and their conjugates computed 
for all pairwise comparisons of treatments. The minimum of each 
rank sum and conjugate is used, the set of minima providing a multi- 
variate rank sum test criterion. These sums are compared with a single 
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tabulated value for the fixed rank sum test and with several values for 
the multiple rank sum tests. Table 2 provides critical values for the 
analogues of Tukey's and the Newman-Keuls tests; Table 3 provides 
for the analogue of Duncan’s test. 

Methods for constructing probability tables and limited tables have 
been presented earlier [16, 17]. Construction of exact tables of any 
extent was beyond the computing facilities available. However, some 
machine time was available and this was used for some sampling ex- 
periments. 

It was originally intended to ignore the discrete nature of the data 
and to use the Kolmogorov-Smirnov [8, 14] one-sample test to determine 
the sample size necessary to attain a certain precision in the constructed 
tables. However, available computing facilities limited sampling to 
values of k = 3 and 4 (2 and 3 treatments when one was control) and 
n = 4 (1) 10. In addition, samples were obtained for k = 5, n = 4, 
5 and k = 6, n = 5. The number of permutations obtained for each 
case was either 5000 or 6000. These tables were used only for checking 
purposes against the few exact distributions available, k = 3 and 
n = 3 (1) 6, and against approximations used in constructing these 
and earlier tables. 

It was assumed that the various multivariate rank sum criteria 
are distributed approximately as multinormal distributions having 
mean vectors and variance-covariance matrices as given in the appendix. 
(Fraser’s [3] vector form of the Wald-Wolfowitz-Hoeffding theorem 
does not apply since the || C,.(z, j) ||’s of Fraser do not exist for the 
test criteria used here.) 

On this assumption, one naturally proceeds to base computation 
on presently available tables. Tables for Tukey’s and Duncan’s tests 
are the obvious choice for all-pairs tests. These tests are based on a 
multinormal distribution with p? = n,n;/(n, + n;)(m + n;) where the 
present distribution calls for p = nn/(mtn +tiimtn + 0, 
a small difference. The appropriate tables are, then, tables of the 
Studentized range with known variance, that is, infinite degrees of 
freedom. Table 22 of Pearson and Hartley [12] is such a table, gives 
percentage points of .10, .05 and .01, is appropriate for the first two 
tests, and was used in computing Table 2. Corrected tables for Duncan's 
test have been computed by Harter [5] and this table was used in 

computing Table 3, also for percentage points of .10, .05 and .01. 
| Table 2 was constructed by taking the integral part of u — tc/ / 2, 
unless the decimal fraction was > .9, in which case the next higher 
integer was tabulated, where ¢ was obtained from the distribution of 
w/o, w = range, Table 22 of Pearson and Hartley [12]. Since rank 
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sums are essentially differences, it is necessary to introduce 1/2 into 
the denominator as shown. Tabulated rank sum values for a = .10 
differed in only two cases from values obtained by sampling. In par- 
ticular, fork = 4, = 4, no value is significant by sampling; for k = 3, 
n = 6, a rank sum of 25 is significant by sampling whereas 26 is not. 
Values for a = .05 differed in no case. Values for a = .01 ran lower 
than those obtained by sampling, the difference increasing with n to a 
value of two in three cases. Hence, it is reasonable to conclude that 
tabulated values of the rank sum are conservative (low) for a = .01. 

The first attempt to construct Table 3 led to values which tended to 
run high for a = .10, correct for a = .05, and low for a = .01, relative to 
values found by sampling. For this reason, tabulated values are of 
u — to//2 decreased by unity for a = .10, as computed for a = .05, 
and increased by unity for a = .01. On this basis, tabulated values 
for a = .10 appear to be low, hence conservative, when not in agreement 
with sampling results; in particular, 7 out of 19 tabulated values are 
one unit low. For a = .05, 3 out of 19 values are one unit high. For 
a = .01, 5 out of 19 values are one unit low, with three of these being 
for n= 5. 

Tables have already been constructed [16], using Dunnett’s [2] 
tables, for the treatments against control test. These tables agree well 
with the sampling results. In no case is there a difference of more 
than one in the value of the test criterion, with the tables most often 
giving the conservative (lower) value. 


TABLE 1 


FINAL WEIGHTS oF CuIcks aT Srx WEEKS (GRAMS) FOR VARIOUS 
SOURCES OF PROTEIN SUPPLEMENT 


H L Sb Sf M 

Horse- Linseed Soybean Sunflower Meat C 

bean Oil Meal Oil Meal Seed Oil Meal Meal Casein 
179 309 243 423 325 368 
160 229 230 340 DIST 390 
136 181 248 392 303 379 
227 141 327 330 315 260 
217 260 329 341 380 404 
168 203 250 226 153 318 
108 148 193 320 263 352 
124 169 271 295 242 359 
143 213 316 334 206 216 
140 257 267 322 344 222 
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4. USE OF TABLES 


To illustrate the use of the tables, the data in Query 60 of Biometrics 
(15) are used. These are presented in Table 1. Since the test is presently 
unavailable for unequal sample sizes, only the first ten items in each 
treatment are used. 


The following set of minimum rank sums is obtained by pairwise 
rankings, a minimum being the lesser of the rank sum T, and its conju- 
gate, T’ = (2n + 1)n — T; minimum treatment is the treatment for 
which the rank sum is minimum. Ties were assigned their average 
rank. This gives a multivariate criterion with 15 entries. 


Comparison H, Sf H, Sb JEL AG! L, Sf HH, M TAG 
Minimum Rank Sum 56 57 58 60 62 643 
Minimum Treatment H H H L H L 


Comparison Sb, Sf H,L L. Sb L,M 8b,C Sf,M M,C 8b,M Sia) 
Minimum 


Rank Sum eal 75 75 754 80 82 84 100 103 
Minimum 
Treatment Sb H L L Sb Sf M Sb Sf 


From Table 2, a rank sum of 67 is significant at the 5 percent level, 
k = 6, = 10; hence six comparisons are declared significant. 

The device, used with multiple comparisons procedures, of under- 
lining treatments which cannot be distinguished by their means may 
be adapted to apply to rank sum procedures. Thus, from the test, 
it appears that H and L, and L, Sb, M, Sf and C form two groups 
as a first step; since L can be distinguished from Sf and C, the latter 
group becomes two, namely L, Sb and M, and Sb, M, Sf and C. We 
have: 


H L Sb M Sf C 


Ordering of Sb and M, and of Sf and C was done on the basis of rank 
sums for these paired comparisons though this does not imply that 
this is the only, or even the best, method. 

This procedure is an analogue of Tukey’s w-procedure [19]. The 
significance level is for an experimentwise error rate. Use of Wilcoxon’s 
[21] two-sample test, with its per-comparison error rate, calls for a 
significant rank sum of 78; this will result in four more comparisons 
being declared significant. 
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TABLE 2 


PERCENTAGE PoINTs oF THE MINIMUM RaANnkK SuM 
(AN APPROXIMATION ) 


Number k = number of treatments being tested 

in a ee 

treatment 3 4 5 6 7 8 9 10 

10 10 10 — =o = = a = 

4 05 = = == —— — — = a 

O1 — = = = — == = = 

10 17 16 15 15 = == a —= 

5 05 16 15 = == = = =F = 

Ol — — = = = = = = 

10 26 24 23 22 22 21 Pj — 

6 05 24 23 22 21 — — — == 

O1 -- — = = = = = <= 

10 36 34 33 32 31 30 30 29 

u 05 34 32 31 30 29 28 28 — 

01 29 28 — — — — — — 

10 48 46 44 43 42 41 40 40 

8 05 45 43 42 40 40 39 38 38 

Ol 40 38 37 36 — — — —_— 

10 62 59 57 56 55 54 53 52 

9 05 59 56 54 53 52 51 50 49 

Ol 52 50 48 47 46 45 — — 

.10 77 74 72 70 69 68 67 66 

10 05 74 7h 68 67 66 64 63 63 

OL 66 63 62 60 59 58 57 56 


It is also possible to propose and carry out a sequential procedure, 
an analogue of the Newman-Keuls procedure, which uses several rank 
sums for testing. For this procedure, the above analysis is the first 
step and has separated the treatments into three groups. To proceed, 
we assume that declared differences are indeed real. Hence to test 
H versus L, the first group, we may use Wilcoxon’s [21] two-sample test, 
H and L are declared significantly different and the line beneath them 
may be removed. 

Further, compare L, Sb and M using the critical value for k = 3, 
n = 10, namely 74. L versus Sb and L versus M at 75 and 75} are be- 
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yond the 10 percent point but are not quite significant. We cannot 
distinguish among these three treatments. 

Finally, consider the group composed of treatments Sb, M, Sf and C. 
The critical value is 71, k = 4,n = 10. The treatments Sb and Sf can 
be distinguished and we must, then, change the order of Sf and C from 
that proposed when the fixed rank sum test was used. No further 
differences will be declared significant by this procedure. We have: 


H L Sb M C Sf 


For the Tukey and Newman-Keuls parametric procedures, we find 
means of 160.2(H), 211.0(LZ), 267.4(Sb), 278.8(M), 323.2(Sf) and 
326.8(C). Also s; = 18.03 and significant ranges are 51.2, 61.5, 67.6, 
71.9 and 75.2 fork = 2, --- , 6 respectively. 

For Tukey’s test, the fixed rank sum is 75.2. We find: 


H L Sb M Sf C 


For the Newman-Keuls procedure, we find: 
H L Sb M Sf C 


We now compare the results obtained from applying the parametric 
and non-parametric procedures. 

The fixed rank sum test and Tukey’s test lead to the same con- 
clusions. Both are based on experimentwise error rates. 

Conclusions drawn from the multiple rank sum test and the Newman- 
Keuls test differ as follows. JZ versus M is significant by the Newman- 
Keuls procedure only; Sb versus Sf is significant by the rank sum test 
only. Otherwise, the procedures lead to the same conclusions. Fifteen 
paired tests have been made. Since L versus M is nearly significant 
and Sb versus Sf is just significant by the multiple rank sum test, it 
would appear that the two methods lead to conlcusions, for this ex- 
ample, that differ only slightly. 

The other multiple rank sum test to be considered is an analogue of 
Duncan’s new multiple range test. Table 3 provides critical values. 
A rank sum of 75 is significant at the 5 percent level, k = 6, n = 10; 
hence nine comparisons are significant. Tentatively, we have: 


veil, ed et) el ae ORS 


Unfortunately, this includes an anomaly since Sb versus Sf is also de- 
clared significant. The same would be true if the Wilcoxon-Mann- 
Whitney test were being used at a significance level (between 5 percent 
and 1 percent) calling for a critical value Of (oO: 
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TABLE 3 


PERCENTAGE Points oF THE MinImMuM RANK SuM FOR DUNCAN ANALOGUE 
(AN APPROXIMATION ) 


Number k = number of treatments being tested 
in a 

treatment 3 4 5 6 7 8 9 10 
10 11 10 10 10 10 10 10 10 
4 05 10 10 10 10 10 10 10 10 
O1 — —— — — — — == = 
10 18 18 17 if 17/ ile 124 ike 
5 05 17 17 16 16 16 16 16 
O01 15 15 15 —— == == — 
10 27 26 26 26 26 25 DAS, 25 
6 05 26 25 DR, 25 25 24 24 24 
O1 23 22 22 22 22 21 21 21 
10 37 37 Barf 36 36 36 36 36 
7 05 36 35 35 35 34 34 34 34 
01 32 22 31 ul 30 30 30 30 
10 50 49 49 49 48 48 48 48 
8 .05 48 47 47 46 46 46 46 45 
.O1 43 42 42 41 Al 41 Al 40 
10 65 64 63 63 62 62 62 62 
9 05 62 61 60 60 60 59 59 59 
01 56 55 54 54 53 53 53} 52 
.10 81 80 79 79 78 78 78 77 
10 .05 77 76 76 75 LS 74 74 74. 
O1 70 69 68 68 67 67 67 66 


On the basis that treatments declared significantly different are 
indeed so, we proceed to test treatments M, Sb, C and Sf using k= 4 
and LZ and M using k = 2. The final result is: 


H ih. Sb eC es) 


This result is the same as that obtained using the Wilcoxon-Mann- 
Whitney test. 


Using Duncan’s (parametric) new multiple range test, we find: 


H L Sb M Sf C 
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Conclusions from Duncan’s test and its rank sum analogue differ 
only in that the parametric test finds Sb versus C to be significant. 

The multiple rank sum test can be adapted to apply to testing 
treatments versus control as well. 


5. APPENDIX—THEORY 


The problem is concerned with pairwise testing of treatments in the 
one-way classification or completely random design. 

The test criteria are rank sums, computed as for Wilcoxon’s [21] 
two sample test, for appropriate pairs of treatments. Rank sums will 
be referred to Tables 2 or 3 for testing rather than to White’s [20] table 
for the Wilcoxon-Mann-Whitney test. 

Two tests will be considered: 

1. The all pairs test, in detail. 

2. Treatments against control, rather briefly. 

For the all pairs test, let X; , 7 = 1, --- , k be random variables 
measuring some characteristic for each of k samples or treatments. 
Let there be n; observations on the 7-th treatment. Computation of 
the test criterion requires us to: 

1. Rank the X,’s and the X;,’s, all 7 < j, assigning rank 1 to the 

least observation. 

2. Add ranks for the variable with fewer observations to give T’;; . 

(There is no loss of generality if we assume n, < nm» < +--+ <n). 

3. Compute the conjugate of 7; , namely T/;=(n,+n,;+1)n;—T;; . 

The conjugate is the rank total that would be obtained if rank 1 
were assigned to the highest observation. Conjugates are required for 
two-tailed tests. 


eo ine : k 
Consider (fin, --- , Tux, Tos ; -** , Ty-1,x), & criterion with (*) 


components. Rank tests are based on the assumption that, under Hy , 
all permutations of the >> n; observations are equally likely. Hence, we 
must know the number of ways in which (7, , --- , T',-1,,) can be 
obtained. For this, a recursion formula is given in [14]. This provides a 
method, though tedious, of deriving the distribution of (T12, +--+ , Tx-1,x)- 

The distribution of (7,2. , «+: , Tx-1,x) has been shown to have the 


following parameters [14]: 
EQ 3) =e = in; +n, 1)/2, 
EQ; — bi) = oi; = nnn, +n, + 1/12, 
E(T yi; — was)(Dng — Bai) = Cring = MNMN;/12 = on;,4; , 
ET: — pad (Tis — wes) = Onee3 = nn, /12, 
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ET,» — Mon (Ts; = Mis) ae OR ye 0, 
Dine = pin,ir = ee = nin;/(Mn Sle 1)(m, ft iD 
pa = 0. 


The determinant of the variance-covariance matrix is: 
k 
TaD + v2”. 


The elements in the inverse of the variance-covariance matrix are: 
Corresponding to the variance of 7; , 


12.0 nat 1 —n, —2)/nn{ Li ta + 1). 
Corresponding to a covariance with the T’s having a common 
subscript, h, in the same position: 


—12/ni( >> ne + 1). 


Corresponding to a covariance with the 7’s having a common sub- 

script, 7, in different positions, for example 7, and 7’; : 
12 /, oes 

Finally, the element corresponding to o,,,;; is zero. 

The determinant of the variance-covariance matrix may be evaluated 
as follows: from the ith row of the determinant, factor 171.;/12, 
i = 1,--- ,k — 1; from the ({k — 1] + 2)-th row, factor ngn,,;/12, 
i=1,-:-,k — 2; --- ; from the last row, factor m-1m,/12. The 


k 
product of these factors is ET at 12) 


The entries in the determinant which is the other factor may be 
described somewhat crudely as follows. 

The i-th diagonal block, 7 = 1, --- , k — 1, which contains the 
variances and covariances of the 7’;,’s, fixed z and 7 > 2, will be 


res ee Ore eae Ni+e2 gen Nr 
Ni+1 ae Cena Jen Oe Nk 
N+ Ni+2 Seiad ar 


This is a (k — 7) X (k — 2) block. 
The block consisting of the same rows and the first (& — 1) columns is 


0 aad 0 —N, Ny, 0 0 orga 0 
OF AO ep ne, Ab Pw aypse OU) Sera 
07 Acwins0° (nie 010 Otome 

SSS SSS 


[aM SHS te) 
2 — 2 columns k — 7 columns 
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The next block to the right consists of k — 2 columns and is: 


Oe “Gace ce —N, Ny 0 D0 -:: O 

‘Oe 0 Soar Ee On (ne 0 CC 0) 

Cees. AO ne 0 20 aad 7. 
ee a 
2 — 3 columns k — ¢ columns 


The pattern is now clear. 
The first block to the right of the 7-th diagonal block consists of ie 
next (k — [z + 1]) columns and is 


ei eT 
eal 0 0 
0 pe eee 0 
0 OO es res 
The block to the right of this is 
0 (oe ee ee 
—Niss Le —%, 
Wes 0 0 
0 Nise 0 
0 ON ns 


Again, the pattern is clear. 

At the next step in evaluating the determinant, from column 1 sub- 
tract columns ({4—1]+1) through ({k—1]+[k—2]). To column 2, add 
column ({k — 1] + 1) and subtract columns ([k — 1] + [k — 2] + 1) 
through ({k — 1] + [k — 2] + [k — 3]). Ingeneral, to column 
i,t = 1, --: , k — 1, add thez — 1 columns described as those con- 
taining the first column of the diagonal blocks previously set out, and 
subtract the sum of the next (k — [¢ + 1]) columns, that is, the columns 
including the (7 + 1)-st diagonal block. 

The result of the above operations is that the first diagonal block 
has >> n; + 1 as common diagonal element and zeros elsewhere. The 
- other blocks containing the first k — 1 columns are of the form previously 
described but with —n, and n, replaced by — (D> n, + 1) and Dyn; + 1 

respectively. 

At the next and final step, leave rows 1, --- , & — 1 unaltered. 
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Consider the next k — 2 rows which we now call the first set, the next 
k — 3 rows called the second set, and so on, the 7-th set consisting of 
(k — {i + 1)) rows, beginning with row alk = 2) dee tothe 
j-the row of the i-th set, add row 7 and subtract row 7 + j, these rows 
coming from the first k — 1 rows of the determinant. 

The result of these operations is a determinant with Se nm; + 1in 
the first k — 1 principal diagonal positions, ones elsewhere in the princi- 
pal diagonal, and zeros below the principal diagonal. Hence the de- 
terminant is as given. 

That the inverse elements are correctly given may be checked by 
multiplying the matrix by the given inverse. 

For the treatments versus control procedure, let X;,7 = 0,1, --:,k 
be random variables measuring some characteristic of a control and 
k treatments with n; observations in the 7-th sample. 

Computation of the test criterion requires us to: 

1. Rank jointly the X,’s and X,’s, 7 fixed, giving rank 1 to the 

least observation. 

2. Add ranks for the variable with fewer observations, here assumed 

to be the check, to give T; . 

3. Compute the conjugate, 7% . 

Consider (7, , --: , T,). Again, a recursion formula for finding the 
number of permutations which give rise to a specific value of 
(T,, °°: , T;) is given in [13]. 

The distribution of (7; , --- , 7.) has been shown to have the follow- 
ing parameters [13]: 


E(P;) = wi = Nol +7; + 1)/2, 
E(T; — wu)” NoN(Ny +n; + 1)/12, 
ET; — w:)(T; — wi) = oi; = nonn,/12, 
pig = 2N,/(M +1; + 1G +n; + 1); 
It may also be shown that the determinant of the variance-co- 


variance matrix is 


1 (Nolo + y(d Te 1/2. 


The diagonal and off-diagonal elements of the inverse are, respectively, 


12(>> n. + 1» / | main =e dn (> Neg Ts 1) | 


art 


-12/ | nto + (> n; + i) |. 


I 
I 


2 
0; 


and 
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To evaluate the determinant of the variance-covariance matrix, 
factor non;/12 from the 7-th row of the determinant. 
This gives: 


nk ie No tn, +1 No ae Np 
456 ny, Oe ak ee on ee Np 
On Ns “i Ne +n, 1 
Next, subtract the k-th row from the i-th row,?=1,---,k—1. 
We obtain 
k 
ne Il n; | No — 1 0 ae —(no + 1) 
13 OW st ee teh) 
ees Ny “55 nN tn, + 1 


Finally, obtain a new k-th column as the sum of all columns. It is 
then apparent that the given determinant is correct. 

That the given inverse elements are correct is seen by multiplying 
the matrix by the stated inverse. 

From the above information, it is possible to tabulate probabilities 
for the parent distribution and derived distributions of interest. 
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THE ESTIMATION OF REPEATABILITY AND 
HERITABILITY FROM 
RECORDS SUBJECT TO CULLING 


R. N. Curnow! 
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University of Aberdeen, Aberdeen, Scotland? 


1. INTRODUCTION 


In any animal breeding selection programme, estimates of repeat- 
ability and heritability are needed to choose between the various 
selection schemes available and also to ensure that the highest possible 
genetic gains are obtained from the chosen scheme. Estimates of 
repeatability and heritability are generally subject to large sampling 
errors. Therefore, the most efficient methods of estimation should 
be used even if they do involve rather lengthy computations. In this 
paper, the maximum likelihood estimation of repeatability and herit- 
ability from records subject to culling will be considered. The more 
usual regression estimators are often very inefficient compared with 
these maximum likelihood estimators. 

Suppose that we wish to estimate the repeatability of lactation yield 
in a herd of dairy cattle. We shall assume that only first and second 
lactation yields are available and that, if there had been no culling 
(i.e., if all the cows had had second records as well as first records), 
the first and second records would have been normally distributed over 
the herd with means yp; and py, , variances o; and co} and covariance 
between the two records of the same cow oi, . The first and second 
records of cow 7 will be written y;, and y;. respectively. The assumption 
of normality for the distributions will probably be a reasonably good 
approximation unless the herd can be split into groups so that any two 
cows in the same group are much more alike than two cows in different 
groups. These groups may, for example, be groups of daughters of the 
same sire or groups of cows according to the year in which they gave 
their first record or the month in which they calved. Methods are 
available for making allowances for such groupings, but they will be 
assumed absent in the rest of this paper. Very rarely will the culling 

1The work on this paper was undertaken while the author was visiting the United States on a 


Harkness Fellowship of the Commonwealth Fund. 
2Present address: Unit of Biometry, University of Reading. 
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in the herd be sufficiently intense or sufficiently highly correlated with 
future milk yielding capacity to affect seriously the normality of the 
distributions. 

Repeatability is defined as the correlation between two different 
records of the same cow and is therefore 


[ova 12/0102 : 


Since we are considering only first and second records, the question 
of whether the repeatability is the same for all pairs of records will not 
be discussed. The assumption is frequently made that oj = o; and, 
therefore, that repeatability is the same quantity as the regression 
coefficient of second records on first, B21 = o12/o; . The assumption 
is made, for example, in the formula given by Lush [1945] for comparing 
cows with a differing number of records and in the formula given by 
Lerner [1958] for the ratio of the heritability of the mean of n records 
to the heritability of a single record. It was also used by Henderson, 
Kempthorne, Searle and von Krosigk [1959] in their discussion of the 
disentanglement of environmental and genetic trends from records 
subject to culling. When of ¥ o3 , an estimate of 6, is needed for 
prediction purposes but for other purposes estimates of o7 and oj may 
also be required. We shall assume in this paper that «7 = o3 = o°, 
say, and therefore that 8., = p. A logarithmic transformation applied 
to the data may sometimes be useful in satisfying this assumption. 

p is generally estimated by 6, the sample regression coefficient of 
second records on first. There are two reasons for this. First, b is 
very simple to calculate and, second, it is an unbiassed estimator of p 
despite any culling that may have been based on the first records. 
However, when o; = o; and the first records of all cows are available, 
whether or not they have second records, b is not the maximum likeli- 
hood estimator of p. The variance of the second records about their 
regression on the first estimates o7(1 — p’) and the variance of all the 
first records estimates oc”. These two estimators can be combined to 
give an estimator of p. Maximum likelihood makes use of this in- 
formation as well as the information given by b. 

We shall derive the maximum likelihood estimators of p and a’. 
We shall show that the efficiency of b as an estimator of p, relative to 
the maximum likelihood estimator, is fairly low for values of the various 
parameters that may well occur in practice. The bias of the maximum 
likelihood estimator is shown to be small. This suggests that the 
maximum likelihood estimator may be worth calculating. The com- 
putations involve only the solution of a cubic equation. A section is 
devoted to an approximate check of the assumption that o? = o2 . 
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Attention has been confined so far to the estimation of repeat- 
ability. The methods to be discussed could also be applied to the 
estimation of heritability from parent-offspring records. The assump- 
tion o; = o; means that, had there been no selection of parents, the 
variance of the parents and of the offspring would have been equal. 
In heritability studies, the maximum likelihood method makes use of 
information about animals that are not parents of animals which also 
have records. In milk yield studies, this would include information on 
dams that have only male calves. 


2. THE MAXIMUM LIKELIHOOD ESTIMATION OF REPEATABILITY 


All the cows have first lactation records but they do not all have 
second lactation records. We shall assume that the probability that a 
cow has a second record depends on its first record but not on any other 
character correlated with the second record. This rules out culling 
based on information about relatives or on characters such as per- 
centage butter-fat. However, the effect of such culling on the estimates 
will often be very small and could probably be safely ignored. 

Let N cows have a first record and n < N cows have a second record. 
Numbering the cows with a second record 7 = 1, 2, --- , n and the 
cows without a second record7 = n + 1,n + 2, --+ , N, the first records 
can be written 


Yi (P12, aN) 
and the second records 
Yi2 (= Ve eee OE 


The WN first records are normally distributed with mean p, and variance 
o. Because the culling is based only on first records, the distribution 
of a second record y,, , given the first y;, , is independent of the dis- 
tribution of the first record and is normal with mean up, + p(yi1 — m1) 


and variance o°(1 — p’). The likelihood of all the records is therefore 


N 
ys Yar “> bi) 
w=1 
> Gao 2a? 
ym [Yi2 — He — PYa — ba) | 


t 


1 
Dee PG 


and the log likelihood is, apart from a constant, 
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In Lb = 24M no — Fm —p) 


N n 
(Gites p) 2d Chee ly)” = do [Yio = bo pYia — yy 
Zo, ee 
Kempthorne and von Krosigk (Henderson, Kempthorne, Searle and 
von Krosigk [1959]) suggested the use of maximum likelihood to 
estimate repeatability from records subject to culling. They derived 
the log likelihood when the cows were classified into groups and when 
the number of records for each cow was perfectly general. They derived 
very lengthy equations from which the maximum likelihood estimates 
of the parameters could be determined. In this paper, we are studying 
in greater detail the special case when there is no grouping of the cows 
and when there are at most only two records per cow. 
(2.1) can be written 


(2.1) 


Ns’ 


n+wN 5 
lay 5 = GF) ino = WL toe 


_ np’ Bi + 22 — 2p2Zr») 


2o°(1 — p) cS 
= N@ = ts)” _ Ge = be pn — real 
20° 2o0°(1 — p) 
where 
N n 
5 do Yaa g) es Ya 7s) 
———— eas (ee, | 
: N ) 21 a n ; 
fF S Cie Go 2 Gar GY — G2) 
aaa n ) “12 = ez, : 
x . " 
dX ait ws Yar De Yi2 
ica ) Tie = a and Y= =r 
N VW 


The maximum likelihood estimators of », and pu, are clearly given by 
fi = 9 and fle = J — AG. — m)- 
Also 

a(n L) 7 _M@ + N) Zi Ns’ a n(p Di + Zs — 2p2i2) 
O(a ) 20° oe Qo°(1 — p’) 


- . (2.3) 
Bis 0s MYo = Be = pr — w)/ 
= Qg° aie Ioi(1 ar p) 
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and 
oun Pane ine ee pS 
dp a (1 — p) 
nG, a La) [Go aloe ae CH = M1) | 
= r=) (2.4) 
a nplGo sa Me = ets aa md)" 
c(h p.) 


By setting (2.3) and (2.4) equal to zero and substituting the maxi- 
mum likelihood estimators of u, and ps , p satisfies the cubic 


(Ns? a ndi)p° Nie N) Zap 
+ [m+ N)Xi — Ns’? — 216 — a+ NZ. = 0. 


This equation will have one and only one root between —1 and +1 
of the same sign as the simple estimator b = = 21/2; . There may 
be two Se roots between —1 and +1 of opposite sign to b. The 
formula for é° is 


(2.5) 


ro A(2i re) ee ey ee 
on = 

pile) 
When there is no culling, s’> = 2? and 


oA tees 2212 7 
sere ae 


(2.6) 


When oj = co; , the maximum likelihood estimator of the correlation 
coefficient has the arithmetic rather than the geometric mean of the 
two sample variances in the denominator. 

To obtain the asymptotic variance of #, we need the expected values 
of the second-order partial derivatives of In L with respect to 4; , ue , p 
and «*. These can be derived from a knowledge of the expected values 
of s*, 2; and 2,,. 2} depends on the method of culling and so will 
be taken as fixed. For reasons to be given later, we shall calculate the 
expected values when oj # «3. Clearly, H(s’) = (N — 1/N)o?. By 
writing 


Yio = Me + ae ire ta as NL pee, , 
1 


so that ¢;., has a standard normal distribution independent of y,, , 


2 


le bs ra 2 
me = s Ee Ya — ji) + oe V1 a p (€:9:4 = ae) } 
i=1 


and, therefore, for given >} , 
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yd 4 pas 1 
E(z3)-= ee 3a ; pie e 
Similarly, 
E(2,2) aa = 2 . 
Also, with of = 03, EY — wu)? = o°/N, Els — ue — eli — pl = 
o(1 — p’)/n and E{(9, — m) [G2 — He — PG — mi} = 0. Aitercon= 


siderable algebra the asymptotic variance—covariance matrix for A, , 
a, , &° and @ is found to be V, where 


| N ane np. = np 0 _npG — Hs) | 
uy dehy) amg d =) (ip 
= np nN 0 nt we Hs) 
y= o(1—p) a (1 —p) COs 
n+WN n 
0 0 ae a (1 es p ) 
nl Sa NG. — ba) fe np r) 
= Var 1e=*p:) o(l—p) o1—p) 


and 


NE ME iar a ie 
op p)i + 2pol + aq A 


The asymptotic variance of A is 
(N + n)(1 — p°)? ; 
n{(N + n)(1 — p')2i1/o° + 2Np'] 
Writing n/N = S,so that (1 — 8) is the culling intensity, and 
>?/o” = 1/c, where c measures the effect of the culling on the variance 


of the first records and is therefore a possible measure of the efficiency 
of the culling, 


Vip) 


1 @+9a=e 
nfQ+ 90-0) 4 92 
gas 


We shall now derive the approximate bias in # as an estimator of p. 
By substituting 6 = p + 6 in (2.5) and ignoring terms in & and 6°, 


5 = (Ns =n) 0 —(N—n) Zia! + [n+ N)2i—N(s’— 22) o—M+N)2r2 
3(Ns’ —n>,”) p’ —2(N —n) 2120 + [(n + N) 21 —N(s'— 23 )] 
(2.8) 


Va (2.7) 
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Approximating £(6) by taking expected values separately in the numer- 
ator and denominator, 


Cg), b= pon = N) 
Tin(n + NY — po) +o [(n — N) + p'(2nN + N — 3n)] 


To order 1/n, 
a p (i) 
nf CSC — ot ae 26" | 


E(6) = (2.9) 


Therefore, from (2.7), the approximate ratio of the bias to the standard 
deviation is 


Ba) p(1 — 8) 
eve viFst! tS — 9%) + 29th 


1— 
eae Max R= ei VO + 
Vn V1 a : 
If 7 is reasonably large, the bias in # is unlikely to be serious. The bias 


will become relatively more important when estimates of p from different 
sources are pooled. 


3. ESTIMATION OF o? AND o2 WHEN @? # 2 
In the previous section we derived the expected values of s’, 23 
and %,. for given 2; when oj ~ o;. They were 
N caee ad 2 
Nicer? 


lig) = 


E(23) = ee i eres een BY 


E(z,.) = © a } 


Apart from the usual eae differences in the BE important 
only for small n and N, the following estimators of 4 = po/ o; and 
o2(1 — p’) are the maximum likelihood estimators when i 0:5 


64 aa Ns’/(N a 1), 
Bor =b= Dy pap , 
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and 
ow 


o3(1 — p ) = [23 a Dae 


n 
n— 2 
From the expected values of s*, 23 and ip given in the previous 


section these estimators are all unbiassed. The maximum likelihood 
estimator of o3/a; is 


aw ; ‘ 
a De Zs aan Die/ 21 1 
me = Se 2 ; (3. ) 
O71 Di § 


No exact method is available for constructing a confidence interval for 
o2/o? except when n = N, 7.e., when there is no culling (see Curnow 
[1957] for references and for a method fopere’ when the data are 


grouped). The asymptotic variance of (03/0;) could be derived and 
used to provide an approximate confidence interval. This would give 
some indication of the importance of the assumption that oj = Ge. 
A simpler, but much less sensitive, test of whether «3 > o; could be 
based on the fact that the quantity 


(N — 1)noi - So L/D 
N(n — 2)o3(1 — p) Ss / 


which compares a x°-value from the variation of the second records 
about their regression on the first with a x’-value from the variation 
of all the first records, has an F-distribution with n — 2 and N — 1 
degrees of freedom. 


(N — 1)n 23 rae Die 2a 
N(n — 2) s” 


significantly greater than F = 1 would suggest that o3(1 — p’) > oj 
and, therefore, that o3 > o; . 


F= 


4, THE EFFICIENCY OF THE SIMPLE REGRESSION ESTIMATOR 
OF REPEATABILITY 


The statistic b = 2%,./D% is very easy to calculate and is the one 
generally used to estimate repeatability. It is the only possible esti- 
mator if first records are available only for those cows also having second 
records. Providing that culling is based solely on the first lactation 
yields, b is always an unbiassed estimator of 82; = po2/o, , but an un- 
biassed estimator of p only when oc} = o;. In this paper we are assuming 
ao, = o,. The variance of b is 


V(b) = o°/nd} = c/n. (4.1) 
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From (2.7), the asymptotic efficiency of b relative to the maximum 
likelihood estimator, A, is therefore 


(1+ Sa — >.) 
(1 +S) =p) + 2ep? 


The values of this efficiency are shown in Table 1 for various values 
of S, pande. 

The following considerations suggest that c is unlikely to be greater 
than c = 2. Let the N cows be reduced to LN by accidental factors 
uncorrelated with the level of yield. Then a proportion P =n/LN=S/L 
can be selected on the basis of first lactation yield. Assume that the 
selection is of the proportion S/L of the herd having the highest yields 
and that n and LN are sufficiently large that the effect of the selection 


Eff. = 


TABLE 1 


Tue Asymptotic ErricleNcy OF THE SIMPLE REGRESSION ESTIMATOR OF 
REPEATABILITY RELATIVE TO THE Maximum LIKELIHOOD ESTIMATOR 


| 
Overall Selec- C= 07/7 
tion Intensity Repeatability 

(S =n/N) | (p) 1/2 1 4/3 2 4 
1/4 0 1.00 1.00 1.00 1.00 1.00 
1/4 0.95 0.90 0.88 0.82 0.70 
1/2 0.79 0.65 0.58 0.48 0.32 
3/ 0.49 0.33 R27 0.20 0.11 

1 0 0 0 0 0 
1/2 0 1.00 1.00 1.00 1.00 1.00 
iW, 0.96 0.92 0.89 0.85 0.74 
1/2 0.82 0.69 0.63 0.53 0.36 
3/ 0.54 0.37 0.30 0.23 0.13 

1 0 0 0 0 0 
3/4 0 1.00 1.00 1.00 1.00 1.00 
1/4 0.96 0.93 0.91 0.87 0.77 
1/2 0.84 0.72 0.66 0.57 0.40 
3/4 0.58 0.40 0.34 0.25 Ou: 

it 0 0 0 0 0 
1 0 1.00 1.00 1.00 1.00 1.00 
1/4 0.97 0.94 0.92 0.88 0.79 
1/2 0.86 ORT 0.69 0.60 0.48 
3/4 0.61 0.44 0.37 0.28 0.16 

1 0 0 0 0 0 
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can be approximated by the truncation of the top P proportion of an 
infinite normal population with mean sy and variance o. The variance 
of the selected first records will then be 


Sy =o[1 —r»v? —T)], 


where vy = Z/P and Z and T are the ordinate and abscissa at the point 
above which lies a proportion P of the population (Finney [1957] derives 
this formula and provides a table of values of y' = »(v — T) for various 
values of P). The division of the overall selection intensity, S, between 
L and P is never complete and is certainly never known exactly. How- 
ever, P is generally considerably larger than S, 7.e., there is a large 
amount of random culling and not so much actual selection. The 
value of ¢ is 


5 = 1/[L—-»— T)). 
1 
For various P, it takes the following values: 


P-=0.95/20.9.. 2 0.89.00 007 0. 3ae nO 
1:94 1AQ AG72, "2.03 92.37 2 5a) 


I 


c 


In practice, P is unlikely to be less than P = 0.7 and therefore c is 
unlikely to exceed c = 2. Table 1 does include c = 4 to illustrate the 
effect of intense selection based on yield and c = 3 to illustrate the 
effect of a selection scheme that results in an increased rather than a 
decreased variance. The efficiency is the same for p as for —p and so p 
is shown as positive in the table. In practice, p is very unlikely to be 
negative. 

When p = 0, the efficiency of the simple regression estimator is 1. 
When p = 1, it is 0. For intermediate values of p, the efficiency in- 
creases rather slowly with S for fixed c, but decreases fairly rapidly 
with increasing c. For all values of c and S, the efficiency is very low for 
high values of p. As an example of the efficiency likely to occur in the 
estimation of the repeatability of lactation records, the efficiency is 
0.66 when c = 3, p = 4 and S = ?. In this case, the maximum likeli- 
hood estimator would certainly be worth calculating. 


5. THE ESTIMATION OF REPEATABILITY BY AN ANALYSIS 
OF VARIANCE 


Sometimes the effect of culling based on the first records is ignored 
and p estimated from a least squares analysis of variance of the N + n 
first and second records. Table 2 shows this analysis of variance. 
The “cows and periods” sum of squares has been split into both “cows 


REPEATABILITY AND HERITABILITY 563 


adjusting for periods and periods ignoring cows” and “cows ignoring 
periods and periods adjusting for cows’. By a period difference is 
meant an average difference between first and second lactation records. 
The expected values of four important mean squares are shown. A, , 
A, and A; are defined at the foot of the table and vanish only if the 
culling is random with respect to the first records. Two cases need to be 
distinguished. In the first we assume yn, = yu, and in the second we do 
not. We shall use the reference numbers at the right of the table. If 
My = M2 , Mean square 2 and a mean square obtained by pooling sums 
of squares 3 and 4 are used to estimate o” and p. This is exactly equiva- 
lent to an analysis of a one way classification with unequal sub-class 
numbers (Snedecor, [1956], §10.16). If u, # pw, , mean squares 1 and 4 
are used to estimate o and p. There seems to be little justification 
for using mean square 2 with mean square 4. One or other is biassed 
or inefficient according as uw; ~ pe Or fy, = Me. 

The bias in any of the above methods is difficult to determine without 
some knowledge of the effect of culling on the expected values of 
2 == 1)/n]o”, T= $5 Gh Hi) —o (hand Gi— i)y — 2) N. 
However, these unknown biasses are clearly undesirable. Since the 
method of estimation described in this paper is the maximum likelihood 
method whether or not the culling is random with respect to first 
records, it is to be preferred. If culling is random with respect to first 
records, c can be substituted as c = 1 in the formulae for the asymptotic 
bias and variance of the maximum likelihood estimator and the asymp- 
totic efficiency of the regression method. 

Wadell [1961] has given a method for estimating p when the culling 
is equivalent to truncation of the first lactation yields. His estimate 
of p is a function of 9%; , J , s’, D{ and the mean squares 2, 3, and 4 of 
Table 2. The estimate has a negligible bias for large values of n but 
its variance is not given and therefore its efficiency is unknown. 


6. A NUMERICAL EXAMPLE 


Dr. A. E. Freeman of the Department of Animal Husbandry, Iowa 
State University has kindly made available to me some Complete Herd 
Improvement Registry Records of an Iowa Board of Control herd of 
Holstein-Friesian cattle. The records included some first and second 
lactation yields made in various years but expressed as deviations from 
the herd average for each particular year. The cows were milked 
twice daily and their yields expressed as mature equivalent 305-day 
lactation yields. 

The values of the various relevant statistics for a sample of the 
records were: 
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TABLE 2 
ANALYSIS OF VARIANCE OF First aND SEconD RECORDS 
, ‘ Ref. 
Source d.f Sums of Squares Expected Values of Mean Squares sh 
2 n+N—2 2 
Cows adj nN <2 CoC gee a Nee 
ee N- 1 = (Eo + 2D. — Bi) + Ns i 
Periods 2 ~ ' 
+oN—p ¥ + 2p — 1) A, 
Periods nN ee 
ign. Cows z N +7 G — 92) 
Nis gee eres ee 
Cows and N 9 + 2242 z;) 4 Ns 
Periods aN 
a0 N+n G a 92) 
Gap) 
,} Ee (n + N)? — (N + 3n) ue 
Cows ign 5 (33 + 22. — Bi) + Ns (N — I(N +n) 
eae N-1 a 3 2 
Periods aN P n(N — n) bi us)? 
a Te Ee a ON? O(N — 1)(N ; 2 1 
a N+ (@ — G9) 2 (GJ. — Ge) 2(N IW +n) 
aF Waal As 
Periods é n : ; 
adj. ee 1 5G: =p) on —"p)iat * (us = th) 4-21 — p) As 3 
ss P ez 
Error aol ; 22 — 222 + 2 a(1—p)+ aa Ay 4 
AUOIUNL, WP top =o i | pe Se We Se ak (g — 9)" 
Adj. for 1 (Ny + ng) 
mean N+n7 
A, < >? — RS a ° 
: n 2 * 


1 1/N -— Bi 2 a 
Oa (p + 2p — 1) Ay + (: ao + 2p — 1)(D* —o /n) 


N—n 
ne (x cin 


As = 3(1 — p)(D? — a /n) = (te — ;) D and D=%y— 1. 


= 220, 
Sb 
= 286.55, 


2 


N+n7 


n = 150, 
qi = 1.147, Yr 
a= 270.515 


S = 0.682; 


INipaane 2/47 
= fk ae pu] D= ae (Dy — « /N) 


—0.207; 
Dio = 125.85 and 3 = 354.98. 


aN 


The maximum likelihood estimator of o3/o7 (8.1) is (63/07) = 1.25. 
This value is almost certainly not significantly greater than o3/o, = 1. 
However, it is sufficiently large to raise some doubts about the assump- 


tion that o} 


= o,. The estimate of o3(1 — p’) is greater, but not 
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significantly greater, than the estimate of o;. However, for illustrative 
purposes we shall assume o3 = o? . 
Equation (2.5) for # is 


p — 0.39224 + 0.12576 — 2.0728 = 0. 
The simple regression estimator of p is 
b = Sis/ ls — 0.465. 


Three cycles of an iteration based on formulae (2.8) and with ~, = b 
as the initial solution show that the value of / is, to three decimal 
places, 

p = 0.404. 


This is the only real root of the maximum likelihood equation for #. 
The two estimators together with their estimated standard errors 
[(2.7) and (4.1)] are therefore: 


b = 0.465 + 0.084 


and 
0.404 + 0.069. 


I| 


p 
The latter standard error is an asymptotic standard error. The esti- 
mated efficiency of the regression estimator is only 67 percent and so 
p is probably well worth calculating. The estimated asymptotic bias 
(2.9) is very small, 
aa 
E(6) = 0.0004. 

The maximum likelihood estimator of o” (2.6) is 6? = 314.46. This 
agrees well with the value s* = 286.55 used above in estimating E(6) 
and the standard errors of b and #. 

The estimated value of ¢ is 


Ci 2 O50: 
This value of ¢ suggests that there has been very little selection based 


on first lactation yield in this particular sample of records. The 
standardized selection differential 


(j, — 9)/s = 0.134 
suggests that the effective selection intensity P was near 0.93. 
7. SUMMARY 


The maximum likelihood estimation of repeatability from first and 
second lactation records of a herd subject to culling is discussed. The 
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method is applicable only when it can be assumed that, had there been 
no culling, the variances of the first and the second lactation records 
would have been equal, and when all the first records are available 
whether or not the cow has a second record. The efficiency of the more 
usual estimate of repeatability, based on the regression of second records 
on first records, is shown to be low when the repeatability is high. In 
many cases the calculation of the maximum likelihood estimate would 
seem to be well worthwhile. The use of the method in estimating 
heritability is mentioned. An illustrative example is given. 
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THREE CLASSES OF UNIVARIATE DISCRETE 
DISTRIBUTIONS 


C. G. Kuarri ann I. R. Parent! 
M.S. University of Baroda, Baroda, India. 


1. INTRODUCTION 


Families of descrete distributions have been developed and studied 
by many authors, including, Neyman [1939], Feller [1943], Skellam 
[1952], Beall and Rescia [1953] and Gurland [1957, 1958]. These 
families are of three types: 


Type A: gu) = exp {h@)}, 
Type B: gas(z) = {h@)}’, 
Type C: gc(z) = ¢ log {h@}, 


where g(z) represents a probability generating function (p.g.f.) and 
h(z) is a p.g.f., except possibly for additive and multiplicative constants. 
The aim of this paper is to set up formulae for certain statistics for 
these types. It is hoped that these will be of use to reseach workers in 
practical fields, who will be formulating compound and generalised 
distributions of these types by using specific forms of h(z). 


2. RECURRENCE RELATIONS FOR PROBABILITIES 


Notations: Let the r-th derivation of f(z) be denoted as f(z). 
Also let 


fy = {g(@)/r}} rant (1) 
and 


I 


fh (2)/r!} leno. - (2) 


By the definition of g(z), it is clear that P, denotes the probability 
of the r-th count in g(z) and z, , the probability of the r-th count, 
excepting possibly for additive and multiplicative constants. 

Type A: Here 


T, 


ga(z) = exp {h(@)}. 


1 Present address: Statistical Officer (I), Rajkot, India. 
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Successive differentiation leads to 
(r) ae . [Baa NE (r—k) (3 
ga @) = 3 eee h'(@)ga (2). ) 
k=1 \ 


Hence, on letting z = 0, we have 
Pe Ss kr,P,-,/r with Po = exp (mo). (4) 
k=1 


Type B: Here 
gu(2) = {hlz)}". 


Successive differentiation of h(z)gQ@(z) = ngs(z)h'’(z) leads to 


o (7 or@gr@ =n D(77 thr@asro. © 


k=1 k =" 1 


Hence, on letting z = 0, we have 


r 


P, = > (nk —r4+ k)n,P,-./rm with Po =m. (6) 


k=1 
Type C: Here 
gc(2) = c log {h(z)} where ec = log {h(1)}’. 


Successive differentiation of h(z)gS?(z) = ch“ (z) leads to 


pil 
nagr@ = ar@ — S(" > \n@ae-Pe. (7) 
k=1 
Hence, on letting zg = 0, we have 
Tae 
P, = {rer, — >< (r — k)n,P,-x}/rm with Py) = clog. (8) 
k=1 


3. FACTORIAL CUMULANTS 

Notations: Let 
iy ne ale) ile (9) 
Min = {g°(@)} z=l ) (10) 


and 


Kin = {(d/dz)" log g@} |. . (11) 


From the definition of factorial cumulants, it is clear that y/,, is the 
r-th factorial moment of h(z) if h(z) is a p.g.f. and M‘,, and K,,, are 
respectively r-th factorial moment and r-th factorial cumulant of g(z).. 
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Type A: Here log {g4(z)} = A(z) and hence, 
y= Hirt: (12) 
Type B: Here 


(2) = log ga(z) = n log h@). 


Hence, on using (11), it is clear that r-th factorial cumulant of gx (2) 
is n times the r-th factorial cumulant of h(z) if h(z) is a p.gf. Also on 
using (7) (with necessary modifications), (9) and (11), we have 


itt 
a p= il 
Kr = Min = ali if \obiRin : (13) 

Type C: Here, it is easy to give a recurrence relation for factorial 
moments JJ{,, rather than factorial cumulants. By using (7), (9) and 
(10), this relation can be shown to be 


: , Ta ae, 1 
Mi. = on a » i: VatnnMEt anf (14) 


k=1 


where ui = A(1). The factorial cumulants can be obtained from (14) 
by using the relations 


Kuy = Mi ) K ja) = Mie, = Mi) ) 
Kas; = Mis; — 3M1.,;Mi, + 2MP , ete. 
4, SPECIAL CASES 

Notation: If a variate has either a Binomial or a Negative Binomial 
law as a special case, we say that it has a general binomial law. 

Type A: From the form of the Poisson-Binomial, Negative-Bi- 
nomial, generalised Polya Aeppli, Beall and Rescia [1953] and Neyman’s 
Types A, B, and C, it is clear that they belong to this form. It is to be 
noted, that, if in the classical problem of egg masses and larvae, the 
egg masses have a Poisson distribution with p.g.f. exp [A(z — 1)] and the 
’ larvae within an egg mass, a distribution with p.g.f. w(z), then the 
distribution of the larvae over the whole field is exp [A{w(z) — 1}] 
which is of Type A with h(z) = A{w(z) — 1}. 

Some important distributions with their recurrence relations are: 

(i) Poisson-Hypergeometric distribution [1958]: Here 


wz) a ys (@zenxe = 1)’/(a ss B) (ry (15) 
where 
k (r) 
Ce ala + ean amo — 1), (") ee ee RD 
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Eb? = kk -— 1) k= + DY, eo ks) 1 sand eg hyeat 


and k are such that w‘” (0)/r! = 7, is positive. The above distribution 
was first given by Gurland [1958]. The recurrence relation in prob- 
abilities is 


Po=X De, Mate: (16) 
s=1 


where P, = exp {A(m — 1)}, and 
atBtr—2—me—k+2r — 3) 
aa r(1 — m) ys 
_ eer DGa ir ae 
rr — 1) — m) hie 


5 d 
De a¢s)(—k) ay m'*/sla + B) cs) and 7, = =e 
s=0 m 


(17) 


FS 
| 


For the particular cases of the above distribution, we may refer to 
Gurland [1958]. The recurrence relation for factorial cumulants is 


Lees hack? m"/(a + B) ce (18) 
(ii) Poisson-Power series distribution: 
we) = Diaz’, (19) 
i=0 


where a,’s are constants such that w(z) is convergent for some z. The 
recurrence relation for probabilities is 


P, = >5sa,P,-./r with Po = exp {X(ao — LD}. (20) 
s=1 


The above distribution was first given by Maritz [1952]. 

Type B: Let the distribution of egg masses be a general binomial 
with p.gf. (1 — p + pz)”. Then the distribution of the larvae over 
the whole field is [1 — p + pw(z)]", ie. the Type B distribution with 
h(z) = {1 — p + pw(z)}. When n = 1, this leads to the distribution 
with p.g.f. w(z) with an addition (or subtraction) of zeros. Ifn > 1 
or n < 0, this can be regarded as n-th confluent of w(z). 

Some important distributions with their recurrence relations are: 

(i) G. Binomial-Hypergeometric distribution: Here w(z) is the same 
as given in (15). This was first stated by Gurland [1958] in the special 
case when n < 0, p < 0. The recurrence relation in probabilities is 


=? dX (ns — r + 8)r,P,-./100 , (21) 
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where Py) = a , G = 1 — p + pm and 7z,’s are defined in (17). 

(ii) G. Binomial-G. Binomial distribution: Here w(z) =(1—m-+mz)*. 
Hence, 7; = m(k — i + 1)2;-,/i(1 — m), with mm = (1 — m)*, 
a =1—p+prand Py = a}. The recurrence formula has the same 
form as in (21). 

(iii) G. Binomial-Poisson distribution: Here w(z) = exp {A(z — 1)}. 
Hence, 7; = \'tm , mT = exp (—A), & = 1—p+ pmand Py = a. 
The recurrence formula has the same form as in (21). When n = 1 and 
p{l — exp (—A)} = 4, this was called by A. C. Cohen [1960] an ex- 
tension of a truncated Poisson distribution. 

Type C: Let the distribution of egg masses be a logarithmic dis- 
tribution with p.g.f. log (1 — \z)/log (1 — 4). Then the distribution of a 
larva over the whole field will be log {1 — dw(z)}/log (1 — A), ie. the 
Type C distribution with h(z) = 1 — Aw(z). The important distribu- 
tions are obtained by considering w(z) as hypergeometric function 
G. Binomial, Power-series etc. 


Choice of a distribution under the condition of no migration: 

Let us suppose that the different sites of a colony are distinct and 
countable, and let there be no migration between sites of a colony. 
Assuming the same probabilities of arriving at a particular site by any 
organism, the distribution of r (when r is fixed) organisms in a particular 
site is Binomial. Now let us suppose that one or more organisms 
arriving at the colony follow the truncated Negative Binomial law. 
(This may be true under the wide applicability of the Negative Binomial 
in biological data, e.g., Bliss [1953], Evans [1953]). Then, it is easy to 
show that the p.g.f. of the organisms in a particular site (when there is 
no migration) is 


1— 6+ of + m — mz) ** =o Nine m) *}/{1 cot m) **} 


hee ee en Ukr Oe: loop Fpl. rete mz) ** 
forr0 <p < {1 — (1 +m) "}",m > Oand kh, > 0. Now suppose 
that the independent results of n sites are combined together. Then 
the p.g.f. of the distribution of organisms is 


eS ade Di mz) **)" 


which is a particular case of G. Binomial-G. Binomial. The above 
distribution can be named as Binomial-Negative Binomial. 


5. METHODS OF ESTIMATION FOR G. BINOMIAL-G. BINOMIAL 


Here the p.gf. is [1 — p + p(l — m+ mz)*]’. 
Method of moments: Let T = Kahin) ee Besa) py and 
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S = Ki,,)/Ki2;. The approximate value of n is obtained from 

n(TS — 2k? + R) + nS(TS — 6R + 6) + S(R — 2) = 0. 
Then k, m and p are estimated from 

k= {8 +nS +72 — R)}/n{nd — R) — 8}, 
(nK 2; + Kiy)/(k — 1)nKy, and p = Kyj/knm. (22) 


I 


m 


Method of maximum likelihood when n and k are fixed: 


Similar to Sprott’s results [1958], we have the maximum-likelihood 
equations as 


fF =nkpm and L(m) = 2a,G(r) — N =0 (23) 


with L’(m) = (d/dm)L(h) = DS a,G(r)[m"{1 — kd — py} —- 
{1 + (1 — m)[mk(1 — p)]"h-*FA G(r)}], where 7 is the sample mean, 
a, is the frequency at the 7r-th count, N is the total frequency, 
G(r) = (r + UP,.,/P,7, A G(r) = Grr + 1) — G(r) and m, # are 
maximum likelihood estimates. 

From the first approximate estimates 9’, ’, the new corrected 
values p”’, 7’ are estimated from 


Te = th! = LG Ei) an 2 at ene. (24) 


Sample zero frequency when n and k are fixed: 


Here the estimates p and m are obtained from 
a = Nil —p+ pil — m)"]" and F = nkpm. (25) 


It may be noted that when n = 1, the two equations in (25) are the 
same as those in (23). 


6. EXAMPLE 


In order to illustrate how the above discussion can help an experi- 
menter in the field of curve fitting, we fit here Binomial-Negative Bi- 
nomial to the data in Distribution 1 of MacGuire et al. [1957, Appendix]. 
From the data, the first four factorial cumulants are K,,, = 2.5900156, 
K(2; = 0.6877630, K,3; = 0.0218497, Ky4; = 0.9080044. 

Hence, on taking n = 1, the solution of k, by the method of moments, 
correct to first decimal place is —12.0. Then the various estimates 
of mand p are m = —0.2204963, p = 0.9788582 by the method of zero- 
cell frequency or maximum-likelihood and m = —0.2196584, p = 
0.982591 by the method of moments. 
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The fits are shown in Table 1. The fits of other distributions are 
given along side for reference purposes only. 


TABLE 1. 

Binomial-Neg. Binomial 
Count Obs. Negative | Poisson; | Poisson 
per fre- by M.L. or | by method | Binomial | Binomial | Power 
plot quency zero fr. of moments [1957] [1957] Series® 
0 355 355.000 346.445 324.30 341.84 | 339.072 
1 600 622.478 628.153 660.37 644.37 | 645.036 
2 781 730.994 735.340 734.06 728.03 | 730.150 
3 567 616.306 618 .023 610.45 609.14 | 610.886 
4 441 417.545 417.393 408 .82 415.60 416.079 
5 245 241 .296 240.550 236 .54 242.72 | 242.339 
6 135 123 .567 122.747 122.49 U2 | IA Bw 
‘aaa 42 57 .406 56.846 58.12 58 .20 57.655 
8 17 24 .632 24.315 25.68 24.76 24.417 
9 dal 9.395 9.731 10.70 9.75 9.567 
10? ill! Geo00 5.457 4.47 5.42 5.267 
x? with 19.184 20.294 34.52 DAS (52 25 .939 
y df. (fh olat A tela. 8.df. 8.d.f Smdets 


2Expected frequencies are from 10 and above 
3Probabilities are calculated from the p.g.f. exp { —(a +b) +az + bz} where a = 1.9022526 and 
6 = 0.3438815. 


7. ASYMPTOTIC EFFICIENCIES 


Here we give the asymptotic efficiencies for the various methods 
of estimation for m and p only ina G. Binomial-G. Binomial distribution. 

The determinant of the information matrix up to order N ‘ for the 
maximum likelihood estimates m and p is 


Dap = N’n?'[npk{k(1 — p) + (1 — mm YR 
eee) i / (lene 


where R = —1 + >> G(r)P, and G(r) = (r + 1)P,41/nkpmP, . 
The determinant of the covariance matrix up to order N * for the 
moment estimates of m and p is 


D = (K,K, + 2K — K;)/[Nk(k — 1)nmK,]’, (27) 


(26) 


where 


Ke= ik iow Bp, K.,/K, = 1+ (k — lm — (K/n), 
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K,/K, = 1+3(k —Dm+& -— Dk 2)m? — 3(K./n) — (Ki/n’) 
and 
K,/K, =1+ 7k — m+ 6% — DR — 2)m’ 
+ (k — 1)(k — 2)(k — 3)m* — 4(K3/n) — 6(K,K./n’) 
— (Ki/n*) — 3(K2/nK,). 
The asymptotic efficiency in the restricted sense is 
EB, = 1/D Die (28) 


The determinant of the covariance matrix up to order NV ~? for the 
sample zero frequency estimates of m and p is 
Dim, p) = mad{ai(1 — Po) — PoK:}/N*amn’K,Po , (29) 
where Gp = 1 = p + pl = mm), Ge = Ke/ Ka ey eae 
a = 1— (1 — m)* — km(1 — m)*"*. The asymptotic efficiency with 
respect to the method of moments is 
E, = D/D(m, p). (30) 
It may be noted that H, = Ey’ when n = 1. 


TABLE 2 
Asymptotic Erricrency 2, ReLative To THE MetuHop or MomENTS 
WHEN n = lanpk = —1. 
m 

p{l —(1 — m)*} = 0 —0.5 -1 —2 
0.3 1.492 1.857 2.193 
0.6 1 Cail Del25 2.630 
0.9 2.444 4.000 4.944 
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INTRODUCTION 


In an earlier paper [1] we challenged the prevalent view that in 
comparisons of pain relieving drugs it is always desirable to have ‘each 
patient act as his own control,” i.e., to test more than one drug on each 
patient and to estimate treatment contrasts from within-patient differ- 
ences. We sought data of other investigators to compare with our own, 
but we were unable to find references which gave sufficient detail to 
permit investigation of the merits of alternative designs and analyses. 
It seemed desirable, therefore, to put our own data on record so as to 
facilitate discussion of the issues raised. This we did, and the response 
has been gratifying. 

This paper presents and discusses some of the suggestions and 
comments of those who wrote to us. We are particularly indebted to 
Frederick Mosteller, Department of Statistics, Harvard University, and 
to Robert Curnow, A. R. C. Unit of Statistics, University of Aberdeen, 
Aberdeen, Scotland, who, in addition to making numerous thoughtful 
comments, performed their own analyses of our data. 


A SIMPLE EXTENSION OF THE MODEL 


As stated in our earlier paper, this study of relief of post-operative 
pain was designed as an incomplete block experiment. Each patient 
was given a dose of a test drug, and the dose was repeated when the 
patient again complained of severe pain. The number of hours of 
“oreater than 50 percent pain relief’? cumulated over both intervals 
was the measure of drug efficacy. A second drug was tested on each 
patient in the same way, starting at the time when the patient again 
complained of severe pain. Three drugs, two levels of a new drug and 
one level of Demerol, were under study. 
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It was found that the total hours of relief achieved when a given 
drug was the second administered was larger on the average than when 
that drug was administered first. This is consistent with the presump- 
tion that the pain decreases with time after operation. However, it 
invalidates a straightforward least squares analysis based on the model 


hours of relief = grand mean + patient effect 
+ drug effect + random error. 


We considered the possibility of extending the analysis to include a 
simple time-period effect, representing the average differences in relief 
between the second and first periods, but for two reasons we did not 
pursue it. Firstly, and most importantly, it seemed to us that such a 
model would be unlikely to be an accurate reflection of the true situation. 
For example, if the duration of drug effect increases with time after 
operation, the period effect should be greater when the second drug is 
given after a potent drug than when it is given after a weak drug. 
Secondly, our object in the first paper was to compare simple alter- 
natives for the design and analysis of studies to evaluate analgesics, 
and a procedure requiring adjustment for time period as well as patient 
effects did not seem attractive as a procedure to recommend to 
clinicians for routine screening of new drugs. (In this connection one 
should remember that it is almost always necessary to discontinue study 
of some patients, often for reasons unrelated to drug response. Thus, 
balance in design is hardly ever achieved.) 

Curnow extended the least squares analysis of our data to take 
account of such a time-period effect. The model becomes 


hours of relief = grand mean + patient effect + drug effect 
+ time pertod effect + random error 


and the corresponding analysis is given in Table I. He stated that the 
differences due to the drug which follows a potent one being administered 
rather later than one which follows a weak one were small and had little 
effect on the estimates of drug differences. Thus the internal evidence 
of the experiment itself does not seem to confirm the fears expressed 
above, and the least squares analysis that includes a time-period effect 
appears to yield a reasonable description of the data. In particular, 
in both the inter- and intra-block analyses, the between-group residual 
mean square is now not larger than the corresponding within-groups 
mean square. 
Curnow went on to point out that the inter-block error is not much 
larger than the intra-block error, and that one might on this ground 
justify an analysis zgnoring patient differences, as shown in Table II. 
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TABLE I 
Least Squares InTRA-BLOcK ANALYSIS 
Source 1D. 8.8. MS. 
Blocks 42 607 .256 
Periods and Drugs 2) 378.015 
Periods ignoring drugs 1 261.628 
Drugs adjusted for periods 2 116.387 58.194 
Between Groups Residual 3 3.051 1.017 
Within Groups Residual 37 428 .945 11.593 
Total 85 1417 .256 
DrucG CoMPARISONS 
Comparison Mean Difference Variance t 
D vs. Ti 3.421 1.178 Bed US 
D vs. Ts 2.037 1.061 1.98 
T3 vs. Ti 1.384 1.025 1235 
TABLE II 
Least Squares ANALYsIS IGNORING PATIENT DIFFERENCES 
Source D.F. S.S. M.S. 
Periods and Drugs 3 434.368 
Periods ignoring drugs Ht 261.588 
Drugs adjusted for periods 2 172.780 86 .390 
Remainder 82 982 .888 11.986 
Total 85 1417 .256 
Drug CoMPARISONS 
Comparison Mean Difference Variance t 
D vs. T; 3.548 0.874 3.80 
D vs. T3 1.861 0.831 2.04 
T3 vs. T1 1.687 0.816 1.87 
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This observation suggests that in future experiments conducted under 
similar conditions we should not arrange matters to eliminate patient 
effects, if it will cost us much to do so. 

Our own calculations and scatter diagrams confirm Curnow’s re- 
sults. However, in view of our first argument against the use of a 
simple time-period effect we found them surprising and made some 
further investigation. We take note first that the fact that a model 
“fits”, in the sense that certain internal checks are satisfied, does not 
insure its correctness. If, for example, second-dose relief were more 
variable than first-dose relief, an examination of residuals would not in 
this case establish that fact; the estimated first-dose residual for a 
given patient has precisely the same magnitude as the estimated second- 
dose residual for that patient when we use the present model. The 
second-dose relief scores are, in fact, more variable than are those for 
first doses. This may be observed by examination of Table 1b in [1] 
and it is reflected in the fact that the within patient residual variance 
(Table 3a), which excludes the component of variability due to patient 
differences, is actually larger than the between patient variance for first 
dose relief (Table 2). 

If the facts are in accord with the assumption that the increment 
in relief with increasing time after administration is proportional to time 
after operation, the effect of the model having only a simple time-period 
effect would be to assign a part of the drug difference to the time-period 
effect and thus to reduce the measured differences between drugs, as 
compared to an analysis for the first period only. In fact, the difference 
between the most and least potent drugs (D and T,) estimated by use 
of the extended model is less by about 15 percent than the estimate 
obtained from analysis of the first period only. 

Granted, then, that the model with a simple time-period effect 
fits the data insofar as intra-block checks are concerned, what may we 
expect to gain or lose if we go ahead and use it? We do have some 
evidence which tends to contradict the assumptions of this model, but 
the effects do not seem large. We have some slight evidence that we 
may lose discriminating power by misinterpreting drug differences as 
time-period effects, but, as judged from these data, the reduction in 
variance may well compensate. Insofar as testing (i.e., deciding which 
drug is better) is concerned, or estimating potency in an experiment 
with a full range of standards, the analysis based on this model should 

give essentially valid results. 

In any event, apart from the question of the best analysis for these 
data, the smallness of the between patients component in the extended 
analysis suggests that we would have achieved at least equal precision 
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for drug comparisons if we had kept each patient on a single drug 
throughout the experimental period. Had we done so, we would have 
had, in addition to possible benefits in precision, the comfort of closely 
imitating the ordinary clinical situation, and the advantage of an un- 
biased estimate of a clearly interpretable measure of effectiveness along 
with an unbiased estimate of its variance. We might also have had the 
cooperation of several more surgeons, some of whom, understandably, 
object to studies which require administration of several coded drugs 
to each patient. 


A REGRESSION MODEL 


In furtherance of the point of view that the cost of statistical work 
is generally small compared to the cost of gathering clinical data, 
Mosteller suggested that the data be analyzed in accordance with a 
model which might give a fairly realistic appraisal of the time-trend 
effect. In particular, he suggested that we not cumulate the two doses 
of each drug but analyze the data for single doses according to the 
following model. 


Yima Sa OX + (b, = Voalina Sp Eimd ) 


where 
a; = effect of 2-th subject, 
6, = effect of medication m when given at time zero, 
Ym = regression of effect of medication m on time of administration, 
tina = time at which individual 7 receives the d-th dose of medica- 


tion m, 
€im¢ = Yandom error. 


This analysis was carried out, using the times of drug administration 
(not shown, but available upon request) in addition to the data already 
presented. It was found that the regression coefficients y,, were quite 
close to one another and, if we restrict the model to the case of equal 
Ym, the least squares estimate of this common value is 0.20, correspond- 
ing to an additional hour of drug relief for every five hour interval 
between operation and drug administration. Using this restricted model 
and doubling the estimated effects to make these results comparable 
to those for our other analyses, we have the estimates of treatment 
differences shown in Table ILI. 

Assuming for now the correctness of the regression model, we see 
that these estimates are quite close to those obtained in the analysis 
using only the first drug for each patient—Table 2 of [1]. However, 
the variances are reduced by about one-third, so that the regression 
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TABLE III 
Druc Comparisons From Regression Mope. ANALYSIS 
Comparison Mean Difference Variance t 
IBY apse, UE%, 4.066 0.824 4.48 
D vs. T; 2.530 0.732 2.96 
T3 vs. Ty Sa: 0.708 1.82 


analysis, using all the data, appears to have about 50 percent greater 
precision than the “first drug only” analysis, which uses only half 
the data. 

Comparing the apparent precisions provided by the above analyses, 
we see that Curnow’s intra-block analysis, adjusting for patients and - 
time periods (Table I) appears comparable in precision to the “first 
drug only” analysis. Curnow’s analysis ignoring patient effects (Table 
II) appears almost equal in precision to the analysis based on the 
regression model. 


OTHER COMMENTS AND ERRATA 


A third correspondent, Irwin Bross, Roswell Park Memorial Insti- 
tute, Roswell Park, New York, raised several points, in part overlapping 
those above. In addition he pointed out that the results of the “least 
squares” analysis (Table 4a in [1]) differ appreciably from the analysis 
based on simple linear combinations of intra-individual contrasts. 
Since the design is nearly balanced, closer agreement might be expected, 
even though our “‘least squares’’ analysis fails to allow for time-period 
effects. In fact, contrary to the suggestion in [1], the “linear combina- 
tions” analysis is not really comparable to the “least squares” analysis. 
The reason is that in estimating, say, D — 7, , no account was taken 
of the information about this difference given by contrasting the direct 


estimate of D — T; with the estimate of T, — T;. If we take an 
average of the directly observed difference, D — 7, , with the contrast 
obtained by combining D — 7 with 7, — 7; , weighting inversely as 


the estimated variances, and proceed similarly for the other compari- 
sions, we get Table IV, which agrees much more closely with the “least 
squares” analysis (Table 4a in [1]). As might be expected, the “‘ex- 
tended least squares’’ analysis which does allow for time-period effects 
‘(Table I) gives results in excellent agreement with Table IV. 

This revision leads in turn to a more precise combined intra- and 
inter-block analysis in place of that shown in Table 8c in [1]. The 
result (not given) is quite close to the “least squares’’ combined analysis. 
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TABLE IV 
Drug CoMPARISONS FROM ANALYSIS OF WITHIN PATIENT CONTRASTS 
Comparison Mean Difference Variance t 
D vs. T; 3.449 1.206 Bet bed 
D vs. T3 2.051 1.073 1.98 
Tis VSenLa 1.398 1.031 1.38 


Both Curnow and Bross pointed out that erroneous entries are 
shown in Table 4c in [1]. This table should be replaced with Table 4c 
(revised) as shown. 


TABLE 4c (revised) 
ComBINED INTRA- AND InTER-BLock ANALYSIS FOR DRuG DIFFERENCES 


1 (aye aes 
— L174 = 0.8518 = 3700 ~ 0.2688 


Calculation of Average Mean Difference: Drug D vs. Drug 7: 
(w)(3.150) + (w’)(3.900) 


wtw’ 


= 3.330. 


Calculation of Variance of Difference 


1 1 


———<— = ese = 0.892. 
wtw’ 0.8518 + 0.2688 
Druce CoMpaRISsONS 
Comparison Mean Difference Variance t 
D vs. T; 3.330 0.892 3.52 
D vs. T3 1.843 0.848 2.00 
T; vs. Ty 1.442 0.828 1.58 


DISCUSSION 


The point which we wished to emphasize in [1] was that on account 
of the increase in duration of relief with time of administration, the use 
of simple intra-patient contrasts—‘each patient his own control’’—may 
not be optimal, and that an experiment in which each patient received 
only a single drug might be preferable. 


METHODOLOGY IN STUDIES OF PAIN RELIEF 583 


The analyses proposed by Curnow and Mosteller have demonstrated 
that for this data a model taking account explicitly of the effect of time 
of drug administration will yield more efficient estimates than does the 
one-way analysis of the first period only. With respect to the analysis 
of the present data, both the Curnow and Mosteller models appear to 
fit quite well, and the estimates of the drug effects produced by their 
analyses are more precise than those derived from the analysis for the 
first period only. 

It is not clear, however, whether the above finding should be con- 
strued as evidence supporting the need for more complex models and 
analyses than are currently in vogue, or whether instead it is evidence 
supporting our original viewpoint. If, as the evidence of Curnow’s 
analysis suggests, patient differences are negligible, we would have no 
reason at all to use each patient as his own control. Were each patient 
given only one drug, the time-period effect of Curnow’s model would be 
eliminated from treatment contrasts and the one-way analysis—using 
now the whole of the data—should be fully efficient. 

Even if patient differences are not negligible, the AGH of 
simplicity, both of design and analysis, may outweigh a small gain in 
efficiency which could be achieved with the use of a more complex 
design and analysis. With our data, for example, the estimates of 
greatest apparent precision were those provided by the regression 
model, and the gain in precision compared to the analysis of the first 
period only was 50 percent. Such a gain is, of course, quite worth 
having, but it is not overwhelming, and it is easy to believe that had 
our design assigned one drug to each patient throughout the period of 
study, the one-way analysis might have equalled the present regression 
analysis in precision. 

In conclusion, we must again emphasize that we do not claim that 
our findings apply to all kinds of pain studies. There may be many 
situations for which within-patient contrasts are far superior to between- 
patient contrasts. What we do claim is that no principal such as 
“each patient his own control’’ is entitled to the status of dogma. 
In some situations, at least, the simpler methods are better. 
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SUMMARY 


This paper discusses the interpretation of frequency data when the 
series of observed frequencies is assumed to arise from a population in 
which the expected frequencies form a geometric progression. Such 
situations occur in the study of steadily increasing insect populations 
and similar phenomena, where the common ratio of frequencies is related 
to the rate of growth of the population. 

The estimation of the common ratio, and tests for the significance of 
departure from geometric trend, are discussed. Asymptotic formulae 
for the tail probabilities in large samples are determined. 

The methods and tests of significance are illustrated by application 
to some experimental data. 


I. INTRODUCTION 


Observations are often recorded by classifying them into several 
classes and counting the frequencies of occurrence in each class. In 
general, the expected frequencies will be partly specified by theoretical 
considerations, but will often also depend on one or more unknown 
parameters. 

The interpretation of such data then involves, firstly, testing its 
concordance with the assumed form of expected frequencies, and 
secondly, estimating the unknown parameters. Thus, for instance, in a 
steadily increasing population of organisms, the number of individuals 
expected in successive age-groups will be in geometric progression, the 
common ratio between the expected numbers representing not only the 
growth of the population but also the effects of mortality, migration 
and other factors. In such a study, one of the objects would be to test 
the concordance with the assumed geometric progression, and the other 
would be to estimate the common ratio accurately. 

In general problems of this kind, the parameters may be estimated, 
and the accuracy of the estimates assessed, by the method of maximum 
likelihood. To test the fit of the model to the data, the x’ test is generally 
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appropriate. If hypothetical values of the parameters have been 
specified, as occurs in many practical instances, their concordance with 
the estimates derived from the data can also be tested by means of x’. 

In the particular case, with which this paper deals, of expected 
frequencies in geometric progression, there exist sufficient statistics for 
the two parameters involved, representing the general level of the 
frequencies and their geometric trend. This facilitates estimation and 
significance testing, although, since the equations of estimation are in 
general non-linear, iterative methods are required in the arithmetical 
work of calculating estimates. 

The need to fit frequencies in this way arose from a method, devised 
by Dr. R. D. Hughes, Division of Entomology, C.8.1.R.0., of using 
the age distribution in an aphid population to study its rate of growth 
in the field. The device of using the proportions of individuals in the 
immature instars in a field population of aphids to give, firstly, the age 
distribution, and secondly, the rate of increase of the population, is 
the subject of a forthcoming paper by Hughes. In the present paper, 
some preliminary observations of instar distribution under controlled 
experimental conditions are discussed. 

Chapman and Robson [1960] have considered the age distribution in 
a stationary population subject to constant mortality. As this formula- 
tion leads to a geometric progression of expected frequencies, many of 
their results anticipate results given in this paper. However, the 
general objects and scope of the two papers are different. 

Aphid populations under uniform and favourable conditions increase 
at a constant rate, so that their expected numbers at equal intervals 
of time form a geometric progression. However, the numbers even of 
an initially small population quickly become too large to be counted 
accurately. An estimate of the rate of growth of the population may 
then be made from the age distribution. The immature aphid passes 
through four instars before reaching maturity. In a stationary popula- 
tion, the number expected in each instar is proportional to its duration. 
lor some species of aphids, for instance A. craccivora, the average 
durations of the first three instars are probably equal under constant 
conditions, so that equal numbers will be expected in each instar in a 
stationary population. When the population is increasing at a constant 
rate, the numbers in each instar will approximate to a geometric pro- 
gression. 

If the common duration of each of the first three instars is c, and 
the growth-rate of the population is p, then the ratio of the number in 
each instar to that in the preceding one is approximately 


Coes 
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In this result, mortality has been neglected, since it has been shown 
that under favourable conditions mortality in the immature stages is 
negligible. 

By taking a random sample from the aphid population and de- 
termining the number in each instar we can first check the validity of 
the assumption of uniform growth by testing whether the numbers are 
consistent with a geometric progression. Having satisfied ourselves 
on this point, we can then estimate the common ratio of the numbers. 
If c, the duration of each instar, is known, the growth-rate can then 
be determined. 

If the durations of the different instars are different, the above 
method will be modified, and the estimation of the growth-rate is a 
little more complicated; in principle, however, such data will still 
provide information from which the erowth-rate can be determined. 


II. THE BASIC DISTRIBUTION 


We consider a sample of N individuals classified into k classes, the 
observed frequency in class 7 being n; , with expected value 
E(n;) = u' (GON I, ecw dee): 
If N is assumed to have a Poisson distribution, so has each of the n, ; 
the joint probability density is then 
pth a ebiig 
Ny ! ONY ire) ie 


Pin hie ,™%-1) = 1a (er Nan, = ihe | 
O- 
where 
TS 1 + ty oe a ie I)nz-1 , 
and 
bon) = Uae ee ame 


From the form of the density it is apparent that N and T are a pair 
of sufficient statistics for the parameters \ and ». Thus, questions of 
estimation may be referred to the joint distribution of N and T. 

By summing the probability (1) over values of the n,; leading to the 
given values of N and 7, we find the joint distribution of N and T as 


CW, T) 
N! 


where C(N, 7’) is a numerical coefficient. 
The number N has a Poisson distribution with mean Ado(u); 


P(N) = &*™ [Ago(u)]"/N1. 


RNG epee Nias (2) 
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Hence for given N, the conditional density of 7’, which is independent 
of X, is 


PT | N) = CW, T)u"/[bo(u)]". 


This is a special case of the distribution discussed by Noack [1950] and 
described by him as a power-series distribution. Our [o(u)]” takes 
the place of his f(z). From this expression for the density we see that 
C(N, T) is the coefficient of u7 in ¢)(u)”. This fact enables the numerical 
coefficient to be determined directly. 

This conditional density of T provides the basis for estimates and 
tests of significance about un. 


II. THE COMBINATORIAL FACTOR 


The coefficient C(N, T) is seen to be a generalization of the binomial 
coefficient (for the positive binomial, when k = 2; for the negative 
binomial, when k = @). It is the number of ways of allocating T like 
objects among N different cells, none of which may contain more than 
k — 1 objects (see Riordan [1958], page 104, where some recurrence 
relations and moment formulae are also given). 

The coefficients are generated by the function 


wor = (=#)"-[o Pt Tels (").-10 


On equating coefficients of u” we find 


a NDan  ya ga) ten ater) 
7) Gee ai Ee 


The series has 1 + [7'/k] terms. This expansion in terms of binomial 
coefficients is useful, especially if & is large, since the first few terms 
then give a close approximation. 

Because of symmetry, C(N, T) = C[N, (k — 1)N — T] so that 
results for 7 > 3(k — 1)N can be found from those for T < }(k — 1)N. 

From the generating function or otherwise we may also deduce the 
recurrence relation 

COON AT ae GC (Ne ie hee ICN 3 1) — CON =" 1) aah) 
which is useful for computation. 

Hitherto we have been considering relations among the C(N, 1’) 
for a fixed value of k. We now consider relations for different k, and 
shall indicate by a suffix the number of classes. 
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We may express the coefficients for k classes in terms of the co- 
efficients corresponding to the factors of k, by means of the following 
reduction formula. 

isha 01g. 


k i= yy i 


oa 4 = = aS hop(u) “boo(u’) 


] = 
dole) = 7 : 


Hence 
CONS TY = CHN, 2D) 4 GN, = Pea 
oC UNG TP =" 2C UNS 2) Tae 


a series of 1 + [7/f] terms. The terms of these series are all positive, 
and generally smaller than those of the series (3) of products of binomial 
coefficients. They may have some advantages for computation, pro- 
vided the coefficients for f and g classes are known. 

By differentiating the generating function with respect to its param- 
eter we may prove recurrence formulae of the type 


ON (k — a 50 (4) 
Sk = DOW = 1, Ty GS CW =a at) 
+ .1 =— (& — DCN —1,T—k + I): 
When k = 3, many particularly simple recurrence formulae may be 
established. Recurrence relations when k = 3 are 
CA, Tee 


T(2N — T) 
{QN — DOIN —1,T — D-E3W = NCW — 2,7 — 2] 


= <a [(2N —T + )CWW,T — 1) + 38NC(N - 1,T — 2)] 
N OY T ry Y T 7 


the last being a particular case of (4). As k increases the recurrence 
formulae become more complicated. 


IV. MOMENTS OF THE CONDITIONAL DISTRIBUTION OF T 


The moments and cumulants of the conditional distribution of T 
are of general interest, and will also be of use when the determination 
of probabilities in the tails of the distribution is being considered. 
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Since the cumulants for a sample of N are simply N times those for a 

sample of 1, we shall consider a sample of 1, for which 7 equals X, the 

number of the class (0, 1, --- , k — 1) in which the observation falls. 
Clearly, if 


o-(u) = (: Y's.) = br uM, 


then the rth moment about zero is ¢,(u)/o(u). This result is equivalent 
to the results of Noack [1950], though expressed in a slightly different 
manner. 

However, unless k is small, these expressions are not convenient 
for the computation of the central moments and cumulants, which are 
more simply found directly, 

The moment-generating function of X is 


Ee’*) = SS ge", > Ke = do(ue’)/do(u). 


Thus the cumulant-generating function is 
K(s) = log ¢o(ue’) — log do(u) 
—log (1 — ue’) + log (1 — p'e") + log (1 — uw) — log (1 — p') 
(a rk 
Dig a mee Ae ai 


From this expansion we derive the particular results 


I 


I 


( Suctia ky" 

ee) ii ena? 

Tred Peete Ae” 
See ee 

ee aie ek ee) 
i 

same ae eG dy in) 
: (1 — p)* Ch Sn 


The form of the cumulant-generating function shows that, if u, is a 
geometric variable with parameter u*, then X + ku, = uw. It also 
follows that, if u,(N) is a negative binomial variable with parameter 1“ 
and index N, then 7 + ku,(N) = w(N). 

When yu = 1, the distribution is the uniform discrete distribution, 
and the cumulants are expressible in terms of Bernoulli’s numbers. 
We then have 


Ky 
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se ei 
) + tox (° sk 


ei . Biss ona a See 


e — 1 
ane 


where s/(e’ — 1) = 1+ SG e®* in symbolic form. Hence 
x, = (kK — 1)B,/r;-in particular, « = (k — D/20 = = ye 
ka = 0, ke = —(k& — 1)/120. 


Relation between Cumulants Corresponding to u and jie 


When the series of frequencies is reversed, the ratio u is replaced 
by its reciprocal. It therefore follows that, when yu is replaced by poms 
the odd cumulants other than «,; are simply changed in sign, and the 


even cumulants are unaffected. We have 
Cg) le), 
(uw) = (—1)"«,(u) (ial): 


These results may also be readily verified from the form of the cumulant- 
generating function. 

Because of these facts, we need not consider values of » outside the 
range (0, 1). 


V. ESTIMATION OF THE RATIO OF FREQUENCIES 


In the conditional distribution, T is a sufficient statistic for the 
parameter », the common ratio of the expected ferquencies. Thus the 
estimation of » is straightforward. However, since the equation of 
estimation by the method of maximum likelihood is non-linear, iterative 
methods will be required to solve it. 

The logarithm of the conditional probability of T, apart from terms 
independent of the parameter, is L = T log » — N log ¢o(u), and its 
first derivative with respect to u is 


(T/u) — (No1/ud0)- (5) 


Equating the derivative to zero gives the maximum likelihood estimator 
of u. We indicate by an asterisk the maximum likelihood estimator and 
functions of it. Thus 


T — (N¢%/¢%) = 0. (6) 


Since the equation is linear in 7’, it is clear that the estimator is to be 
found simply by equating 7 to its expectation: T = Nx* , as may be 
verified from the results of the previous section. 
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The variance of the derivative (5) gives the information J, about 
in the sample. Being linear in T’, the derivative has variance 


aN al N Kb ku 
V(T)/u Nko/p = Yap - ee) = 

This result is a special case of the results of Patil [1961], who investi- 
gates the estimation of the parameter of a generalized power-series 
distribution. In large See the reciprocal of J, approximates the 
variance of the estimate u*; that i is, V(u*) = w/Nro . 

For purposes of calculating and tabulating the solutions of the 
maximum likelihood equation, we shall put 


Tih — DN =, 


so that; for allk, 0 < » < 1. 
Then the estimating equation may be written 


ge ine ee ; 
1 1l—up 1—, )o. (7) 


An alternative form is 


1 k 
[ee ae DL 


or 


—k 


k 
eee 


“" ig the root 


Thus, if u* is the root corresponding to v, then u* 
corresponding to 1 — v. In particular, if v = 4, u* = 1. 

Since the roots corresponding to values of v exceeding 4 are the 
reciprocals of roots corresponding to values of v less than 4, we may 
henceforth confine attention tov < 4,4 <1. 

Equation (7) can be solved iteratively, once an approximate value 
of uw has been chosen. If v is not too near 4, and k is not small, a first 
approximation is 

(k — 1)v T 


mM" 1T+h—-Do T+’ . 


and a second approximation is 
k 
Me = wy + kui oe oe 


The difference between the two sides of (7) when an approximate 
value of » is substituted represents —y/N times (5), the first derivative 
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of the likelihood. The adjustment to u is given by the ratio of this 
difference to —yJ,/N. Then, approximately, 


ares ne a sea 


Kg 


The substitution and adjustment may be repeated as often as required 
to give the desired accuracy. 
If v is close to 3, an approximation alternative to (8) is to be pre- 


ferred. We put 


-—6 
mee, => — 


Then equation (7) becomes 


or, symbolically in terms of Bernoulli’s numbers, 


ze" ba? eek =(k— 1) — w). 


We then find 


3 

a (k? — 1) + ne (k* — 1) + 0(6) = (ke — Du, 

whence 
_ 120 144 (kh? + 1)u* , 
Ral mcerian reerpeeyn ec 

and 

Pelee 72w 144 (kh? + 11)u* 3 

wT EL Ge De 8 ae 


Solutions of the maximum likelihood equation for various values 
of k and v are given in Table 1. Once values of u are given, it is easy 
te compute J, . As we shall see in Section X, the solution not only 
gives a point estimate of the ratio u, and an approximate standard 
error based on the information function J, , but also gives a means of 


determining tail probabilities, needed in making significance tests and 
setting confidence limits. 


VI. ESTIMATION OF 


In general, for the problems considered in this paper, the actual 
value of \, representing the size of the population (or rather, of the 
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TABLE 1 
Maximum Like.inoop Estimate oF pu [with ARGUMENT v = 7'/(k — 1)N] 


k 2 3 4 5 
0.00 0 0 0 0 
0.05 052632 092894 131334 167119 
0.10 111111 178395 238358 291010 
0.15 176471 261940 333333 392939 
0.20 250000 346500 422530 483323 
0.25 333333 434259 509668 567737 
0.30 428571 527202 597388 649654 
0.35 538462 627431 687922 731617 
0.40 666667 737405 783468 815815 
0.45 818182 860221 886484 904443 
0.50 1.000000 1.000000 1.000000 1.000000 


zero-class), is not of interest. However, for testing the significance of 
departure of the frequencies from a geometric progression, estimates of 
the expected frequencies in each class will sometimes be required. 
We shall then be interested in the simultaneous estimation of \ and M, 
based on N and T. We therefore consider this question, rather than 
the estimation of X alone. 

The joint probability of NV and T, as given in (2), is 


CN, hi) —Ado(u)\N T 
ea 
so that the logarithm of the likelihood, considered as a function of the 
two parameters, and with constant factors ignored, is L = —)d¢o(u) + 
N logA + T log pw. The derivatives are 
[Lh = —o(u) a (N/), (10) 


L, = [=—Ad,(u) + TY/u. 


The estimates are given when these derivatives are equated to zero, 
or when N and 7’ are equated to their expected values. For direct 
solution, \ can be eliminated between the two equations, giving an 
equation for uw identical with (6). w having been found, \ may then be 
found by substitution in (10). 

For simultaneous solution, and also to give a measure of the infor- 
mation about » and yp, we require the covariance matrix of the deriva- 
tives. Noting that N is a Poisson variate, with variance equal to its 
mean A¢o(u), that the covariance of N and T is d¢,(u) and that the 
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variance of 7’ is \¢2(u), we find for the covariance matrix of the deriva- 
tives 


ae le s/n) 
:/M b2/u 


The inverse of I, giving approximate variances and covariance of 
the estimates in large samples, 1s 


a penne Oe Nps — Auda | 
Moods is $;) —)udy udo 


From trial values \, » of the parameters we find improved estimates 
by means of the equations 


een 
ie m 15 


which give explicitly 


yx — Noo = Tor 
Dobe a Pi (11) 
No — He. 
nT fey eee ea 
fC aera | Nob. — $1) 


Note that the trial value of \ does not appear in the equation for 
\*, since the estimating equations are linear in \*. The adjustments 
given by (11) may be repeated to give the accuracy required. 

The expected frequency in class 7 is \u*; the variance of the esti- 
mated frequency may be determined, using I’, as 


pbs — oi 
The relative variance of the estimated frequency is therefore 
Qn 2161 oa Vo. 
Noobs a: $1) 


VII. TESTS OF DEPARTURE FROM PROPORTIONALITY 


(bo or 21d, = 1 bo). : 


Before any use is made of data to which proportional frequencies 
have been fitted, it is necessary to test whether the observations depart 
significantly from the estimated frequencies. 

Following Cochran [1954], we shall denote by X”’ the statistic used 
for testing the discrepancy between observed and expected frequencies, 
and by x’ the random variable with the familiar distribution, to which 
the distribution of X° approximates. We consider here the adequacy 
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of the x’-approximation in testing X” for departure from expectation 
in the present problem. 

Alternatively, we may use the fact that, because N and T are 
sufficient for \ and yu, the probability of a sample, conditional on N and 
T, is independent of the parameter values. An exact conditional test 
can thus be made, provided there are several possible samples corre- 
sponding to the given values of N and 7. The probability of a sample 
of values %>) , 2, °-** , Ne-1 With Ds mn, = Nand Ds in; = Tis 


ne 
iT (n; !) : 
Likewise the joint probability of N and T is 


CW, T) 
N! 


Ado (u) NT 
jE 


po (uy NT 
€ Pa Naliet, 


where C(N, T) is as defined in (2). Hence, the probability of the sample 
conditional on N and T is 


N! 
CONTE [Gt 


By enumerating all possible samples and their probabilities we can 
decide the significance of an observed sample. Often the set of samples 
deemed to show significant departure from proportionality will be taken 
as a set of the samples with the smallest probabilities; however, in 
many problems, other criteria may be considered more appropriate. 
The use of X” has the advantage that it orders the samples according to 
a quantitative measure of departure from proportionality, whereas the 
probability attaching to a particular sample may be small merely be- 
cause of the discontinuity of the distribution. A satisfactory arrange- 
ment would therefore be to order the samples by means of X’, but 
to use the exact probabilities. In many cases the exact cumulated 
probabilities will not differ much from those given by the x’-distribution. 
The questions of calculation of probabilities and of choice of the signifi- 
cant set have been thoroughly discussed by Fisher [1950]. 

As an example of the exact calculation, we consider the following 
data in four classes: 


(12) 


Observed Expected 
No 30 224 
Ni 5 9 
Ne 2 3 
N3 3 il 
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It will be seen that, because the numbers are restricted to four classes, 
the possibilities are much fewer in number than in other examples, 
such as that of the testing of deviations from a Poisson distribution, 
as discussed by Fisher. For this set, N = 40, T = 18, and it is readily 
verified that u* = 3, \* = 27. X’ is easily calculated as 


2 2 2 2 
Toy (Mey yn ae ee aN 
apn Sg cicna cae N 0445, 


however, since the expected values in two of the classes are small, 
the tabulated distribution of x’ is not likely to be a good approximation 
to the distribution of X’ for this sample. For this reason, as an example 
of method, the exact probabilities have been determined for the 37 
possible samples having values N = 40, 7 = 18. The probabilities 
and the values of X” are given in Table 2. 

We have C(40, 18) = 220,495,831 X 10°. 

From the Table it is seen that the sample in question just attains 
significance at the 5 percent level; this is so whether the samples are 
ordered according to their probability or by X°. The tabulated distri- 
bution would give a probability, for x° with two degrees of freedom, 


eX" = e 3? — 0.039866. 


This shows that, for this example, the tabulated distribution under- 
estimates the significance probability only slightly. On the other hand, 
for the sample 


aoe ae 
for which the probability accumulated according to X” is 0.008190, 
Xe = 8,595, 
the probability of exceeding which is 
0.013619. 


The probability here is overestimated somewhat. 

As a matter of interest, the mean and variance of for the distribution 
generated by this set are H(X°) = 1.9956, V(X*) = 3.4223, compared 
with the values of 2 and 4 respectively for the x7-distribution. The 
low variance will usually lead to overestimation of probabilities at the 
tails of the distribution and underestimation of significance, when 
referred to x’, though this effect will sometimes be masked by local 
irregularities of the distribution. 


VIII. MOMENTS OF THE NON-PARAMETRIC DISTRIBUTION 


We now show how the moments of the exact distribution of X” 
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TABLE 2. 
THE FREQUENCIES IN Four CuassEs FOR WHICH N = 40,T = 18 


Cumulative 
no ny No Ns Probability probability 27 x? 
34 0 0 6 .000000 .000000 1048 
31 0 9 0 .000001 .000001 610 
33 0 3 4 .000003 .000004 522 
33 1 1 5 .000004 .000008 696 
32 0 6 2 .000010 .000018 376 
32 3 0 5 .000019 .000037 646 
31 1 vi 1 .000089 .000126 352 
32 1 4 3 .000098 .000224 334 
32 2 2 4 .000146 .000370 424 
30 2 8 0 .000173 .000543 408 
22 18 0 0 .000514 .001058 376 
31 4 fi 4 .000781 .001839 370 
30 6 0) 4 .000807 .002646 360 
31 2 5 2 .000938 003584 226 
31 3S 3 3 .002083 .005667 232 
30 3 6 1 .003229 .008896 198 
29 4 7 0) .003460 .012356 250 
24 15 0 1 .004561 .016917 198 
28 9 0 3 005574 022491 190 
23 16 1 0 .006841 029332 226 
26 12 0 2 .009578 .038910 136 
30 5 2 3 .009688 .048598 174 
30 4 4 2 .012110 .060708 120 
29 7 1 3 .013840 .074547 160 
28 6 6 0 .023412 .097959 136 
29 5 5 1 .029063 . 127022 88 
24 14 2 0 .034206 . 161228 120 
25 13 1 1 .038311 . 199539 88 
27 10 1 2 046824 . 246363 66 
29 6 3 2 048439 294802 58 
Dit 8 5 0 .070236 865038 66 
28 8 2 2 .075253 .440291 40 
25 12 3 0 083006 . 523297 58 
28 Uf 4 1 . 100337 623634 22 
26 10 4 0) . 105354 . 728988 40 
26 Tal 2 1 114932 8438920 22 
27 9 3 1 . 156080 1.000000 0 


may be determined. This distribution is independent of the population 
parameters, and depends only on the values of N and 7. 
We consider first the conditional moments of the n; . I'rom ex- 


pression (12) we see that 
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CIN, T) _ 


1 
N! eae re 


where the sum is taken over all sets of the n; such that > n, = IN and 
>> in; = T. Now the expected value of 7; is 


N! N; 
C(N, T) d Il @: 2 


In the sum, the total of the arguments, with 1; replaced by n; — lI, 
is N — 1, and the weighted sum of the arguments is T — 7. It follows 
that E(n;) = NC(N — 1, T — 7)/C(N, T). By a similar method it 
follows that, if >) a; = A, and > ia; = B, then the general factorial 
moment is 


Bf TT @: | NI CV — A,T —B) 
Th Ge C(N, T) 


Thus in principle the joint factorial moments of the n; can be de- 
termined provided we can regard the C(N, 7’) as known. Some dis- 
cussion of the asymptotic representation of C(N, T) is given in Section 
X, but more needs to be known about these coefficients in general. 

For the test of departure from expected frequencies we have 


2 
N; 


aon 


Now \* and p* are functions of N and T only, so are fixed for the con- 
ditional distribution. Hence X” may be simply regarded as a weighted 
sum of squares of the n,. In particular, since 


INN =O Nie 2 Ea 20) NC NESS ree 


2 
ac CW, T) 

we have 

N 
4 2 —_— EEE 
OY CNP) 

N CN = eno Stele 
os! CIN — 2, BB ES (tes ieee 


higher moments may be found similarly. 


IX. TEST FOR TREND IN FREQUENCIES 


Of particular importance is the test for the reality of any apparent 
trend in the frequencies—that is, whether the estimated value of u 
differs significantly from unity. When JN is large, the test may be 
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made by comparing the difference 1 — u* with its standard error, or, 
what is equivalent, testing 4(k — 1)N — T against its standard error. 

When y» = 1, the variance of u* reduces to 12/(k” — 1)N, so that 
the test statistic, distributed approximately as x” with 1 degree of 
freedom, is 


(k* — 1)N(1 — y*)?/12. (13) 
More directly, the variance of T is 
[(k* — 1)N]/12, 

so that an alternative test statistic is 

12(3(k —-1)N -—TY/(k? — DN. (14) 
The statistics (13) and (14) are equivalent in large samples since, 
when u* is near to unity, we have 
EE) 

(ky — DN 


wt = 1 + OW”), 


as can be seen from (9). 

The exact significance probability is given by the probability of a 
value of 7 less than or equal to that observed, which is easily computed 
directly when N is not large. This probability should be doubled, 
since both tails of the distribution are relevant to this test. With 
uw = I, this probability is simply 


tif 
DS CIN Ak = ISN Lee: 
Now the generating function of S(N, 7’) is 


[ow "/A — ») = 1 — 2/0 — »)*. 


From this representation we readily deduce that 


won 2) -(YETED 
+ we a ai) She = aD) 


which provides the most convenient means of computation of isolated 
values of the sum. On putting 7 = (k — 1)N in (15) we have the 
interesting corollary 


2) 9) + 8) =n 
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Values of T less than 3(k — 1)N required for significance at the 5 
and 1 percent levels of probability have been determined for various 
values of k and N, and are presented in Table 3. For values of NV beyond 
the range of the Table, the large-sample x’-test may be used. 


TABLE 3 


Exact Trst ror EXISTENCE OF TREND 
(1B. or Nutt Hyporuesis » = 1) 


5 Percent Point or T [or (k — 1)N — T] 


k 
N 3 4 5 
5 1 2 3 
6 1 3 4 
7 2 4 6 
8 3 5 7 
9 3 6 9 
10 4 a 10 
11 5 8 12 
12 6 9 13 
13 6 11 15 
14 i 12 17 
15 8 13 18 


1 Percent Point or T for (k — 1)N -- T] 


k 
N 3 4 5 
5 0 0 1 
6 0 1 2 
7 1 2 4 
8 1 3 5 
9 2 4 6 
10 3 5 8 
11 3 6 9 
12 4 7 ilu 
13 5 8 12 
14 5 9 14 
15 6 11 15 


FITTING A GEOMETRIC PROGRESSION 601 


X. EVALUATION OF TAIL PROBABILITIES 


When WN is large, the calculation of the probabilities of the tails 
of the distribution is troublesome. Blackwell and Hodges [1959] have 
given a method for expeditiously finding approximate tail probabilities. 
The method depends on a transformation of the distribution to a 
new distribution for which probabilities in the neighbourhood of the 
mean are to be determined. Daniels [1954] has given equivalent re- 
sults, expressed explicitly for continuous probability densities, and 
Good [1957] has given similar results to the term of order N” of the 
leading term. 

We briefly outline Blackwell and Hodges’ results applied to the 
present distribution. We make use of the fact that the distribution 
of T is the N-fold convolution of the distribution of X as defined in 
Section IV. 

The moment-generating function of X — a, where a is any constant, 
is M(s) = Efe’"*~”]. 

Let s* be the unique value of s that minimizes M(s), and denote 
the minimum of M(s) by m(a); note that s* is a function of a. Then 
we transform to a new variable Y, for which 


P(X = ae"? 


P(Y =2) = Aa) 


(16) 


It is readily seen that Y has in fact a probability density with the same 
range as X. Also, by differentiating the numerator of (16) with respect 
to s* and summing, we find that H(Y) = a. 

Before continuing with Blackwell and Hodges’ method, we establish 
the interesting result that, for any distribution for which the sum of 
the values of X is a sufficient statistic for the parameter u, the new 
variate Y has a distribution of the same form, but with parameter u*, 
the maximum likelihood estimate corresponding to the value X = a. 
This result is probably well known, though we have never seen it 
discussed. Thus we see that, for distributions of this commonly occurr- 
ing class, the solution of the maximum likelihood equation is important 
not only for providing a point estimate, but also for evaluating the tail 
probabilities corresponding to the particular observed value. 

If the sum of the values of X is sufficient for u, the density of X 
may be written as 


h(X)e*" /f(u), (17) 
where 


fw) = DX hae. 
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Then 
M(s) AS 3 hae Os 


so that, for any value of s whatever, a transformed density is given by 
h(X)e* "4"! /e* Ms). 
This density is clearly of the same form as that of X. 
Now the maximum likelihood equation, corresponding to an observed 
value a, turns out to be 


& — >> rhe" D> hae” 10; 


zx 


where we have differentiated (17) with respect to g(u) rather than p 
itself. This is in fact H(X | » = u*) = a. Now from (16) we derived 
that E(Y) = a. It follows therefore that the parameter of the distribu- 
tion of Y is u*. 

We can also express m(a) in terms of the maximum likeli- 
hood estimator. On putting x = a in (16), we see that m(a) = 
P(X = a)/P(Y = a), or P(@| »)/P(a | u*). 

For the particular distribution we have been considering, therefore 


ma) = (5) (6) 


where we indicate by an asterisk a function of p*. 


Blackwell and Hodges, by means of the transformation (16), and 
taking Na = T, show that 


P(X + Xe bo + Xv = 1) = [m@)"P, Ya = a ae 


and thence, that each side is equal to 


[m(a)]” | 1 ee 55 as 7 
AORN GN ae 3) t OW) |, 


OF 


yu \N 
w\'(o% 1 | ee a Bi : 
(4) (3) Vans Lt an Ve? > 3 ee) + OO) 


where the «* are the cumulants of the distribution of Y; that is, func- 
tions of *. The explicit terms in (18) give a satisfactory approximation 
to P(T | N), provided WN is large. 

We note that the series in (18) depends only on y* (ie., on 7’), and 


not on w. Thus what we have obtained is an asymptotic expansion of 
CW, T). In fact, 
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4N ok ok? 
oo E tox we = a) % ow |. 


BehzusS 3 
wr 2a kk ars 


Thus, C(NV, 7) may be estimated by substituting the actual values of 
the cumulants, from the formulae given in Section IV. 

As a check on the asymptotic formula, we estimate C(N, T) for 
the values k = 4, N = 15, T = 11. We deduce p* = 3, d& = 15/8, 


m= 104/15: 107862222. 
kX = 2842/15° = 0.842074, 
1434/15* = 0.028326. 


a 
ny 
l| 


Hence 


2 15 (15/8)” 1.805604 
Glib. F1).43 on 194 at (1 _ 1.805604) = 2,784,963. 


The true value is 


2a (ey Ce eee 
& SN Gg Mt ed ae 
from which the estimate by the asymptotic formula is in error by 1380 


per million. 

For significance testing, the cumulative tail probabilities are more 
important than the individual terms. These cumulative probabilities 
are also given by Blackwell and Hodges. They give 


____[m@]" | By i fe Gy 


zZ Gs — 2 +040 + 2) s ow" | 


7 2N KE?(1 — 2)" 
P=) ah eee k less z)+«4(1 + 2) SS | 
eae E Ni ena ee 


where, for the distribution considered here, z = »/u*. We note that, 
as is to be expected, the bracketed series in the cumulative probability 
depends on ». The approximation is valid only for »* > yn, and is 
most effective when z is small—that is, when y* is large and the extreme 
tails of the distribution are being considered. 

When we are determining the probability in the lower tail of the 
distribution, that is P(T < 1), so that T < i(k — IN, and p* < g, 
we simply replace z by ¢ throughout the formula, and change the 


sign of x%. 
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As a test of the adequacy of this approximation, we apply it to the 
example for which the individual term was calculated above, namely 
k=4,N = 15,7 = 11. 

Inserting the values found above, we have z = 2, 


P(T = 11) 
1 
(: . 2) 

S| = | a | 
oe AG oo ie eo? ~2 
60u 194? ( es a | 

15° 2u 
We consider the adequacy of the approximation when » = 1, for which 
we can readily determine the exact probability. We have 
PO = 11) = 23784 600 /4 
so the leading term in the approximate probability is 
2P(T = 11) = 5,569,200/4!> = 0.005187. 


For the first adjustment we find the factor 1 — 0.078223 giving 
5,133,560/4"° = 0.004781. The exact value of the denominator is 


sas, = (2) ~()(2) + ()(8) = san, 


giving the exact probability 0.004893. 

We see that the first adjustment gives an approximation adequate 
for the purpose of assessing the significance probability. Since both 
tails of the distribution are relevant to any test of significance, the 
significance probability is 0.009786; 7 = 11 is thus the 1 percent point 
of the distribution. 

By determining the tail probabilities corresponding to any given 
value of » we can in principle set a confidence range for » — the set of 
all « which are not discordant with the observed values N and YT. 


P(T < 11) = 


XI. APPLICATION TO EXPERIMENTAL DATA 


The data in Table 4 show numbers in each of the first three instars 
of immature cowpea aphids (A. craccivora), from samples drawn on six 
different occasions from the same population. Since the experimental 
conditions were uniform, and the average duration of each instar is the 
same (about 42 hours at 20°C), the expected numbers are in geometric 
progression. The Table also gives an overall estimate of the common 
ratio, 0.529820. In order to determine the growth-rate of the population, 
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TABLE 4 
Acs-DIsTRIBUTION oF ImmMaTURE CowPEA APHIDS (A. craccivora) 
Instar 

Sample Ti Il Ill N dt a fae xe 
1 181 92 51 324 194 179.730 0.526015 0.11 

2 148 78 42 268 162 147.737 0.531518 0.01 

3 130 54 43 227 140 123.456 0.543412 4.07 

4 7 37 21 128 79 69.580 0.543848 0.03 

5 88 41 20 149 81 87.710 0.474049 0.13 

6 85 42 28 155 98 82.857 0.558629 0.63 
Total 702 | 344 | 205 | 1251 754 690.958 0.529820 2.14 


we equate this ratio to e °’, c being the average duration of instar and p 
the growth-rate. 


We then find 
cp = —log (0.529820) = 0.6352 
so that 
p = 0.6352/42 per hour = 0.01512 per hour = 0.363 per day. 


0.363 


The population grows at the rate of 100(e — 1) = 44 percent per day. 
Table 4 also gives estimates of \ and yu for each sample, and X’ 
with 1 degree of freedom measuring departure from the common ratio. 
Only the largest X°-value, for sample 3, attains the 5 percent level of 
significance, so that there is no evidence for departure from hypothesis. 
The variance of the estimate of » from a sample of N is p’/V(T) 
which reduces when k = 3 to 
(OA Re) Ora aa 
NC ai rad) 


Using the overall estimate u* = 0.529820 we find the variance of esti- 
mate to be 


V(u* | WY) = 0.510813/N. 
The consistency of the estimates of » from the different samples 


may be tested by X”’. 
For consistency, 


xe (0.526015 X 324 + 0.531518" X 268 
+ --- — 0.529820? K 1251)/0.510813 
= 1.84, with 5 degrees of freedom. 
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The individual ratios differ no more than would be expected by chance. 
After the manner of Section IX, formulae (13) and (14), an alter- 
native statistic for testing consistency is based on comparison of the 
ratios T'/N. 
V(T/N) = wl + 4u + w)/NG + w+ py? = 0.549534/N 
Hence 
X? = (1947/324 + 162°/268 + --- — 754°/1251)/0.549534 


1.33. 


I 
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INTRODUCTION 


Methods for estimating variance components from unbalanced data 
are given in Henderson [1953] for both the random model and the 
mixed model. The calculations involved in the latter case are some- 
what tedious and usually not computationally feasible for data having 
many classes. This paper outlines the analysis for unbalanced data 
from a two-way classification with one classification considered fixed. 
Simplified computing procedures are presented suitable for a large 
number of levels in the random classification and a reasonable number 
of levels of the fixed classification. 


MODELS AND ESTIMATION 


The model for the two-way classification can be taken as 

Sie = er Os BF AOR erin (1) 

where 2;;;, 1s the observation and yp is the general mean. a; is the effect 
due to the z’th level of the a-class, 8; the effect due to the j’th level of 
the $-class, a8;; the interaction and e;;, a random error term. We will 
suppose that the number of a-classes in the data is a, the number of 
B-classes is b, that there are n,; observations in the 7j’th subclass, and 
that s sub-classes have observations in them. All terms except u are 
taken as independent random variables in the random model with zero 
means and variances o2 , 73 , 743 and a7, respectively. The #-classifica- 
tion will be considered fixed in the mixed model, the variances involved 
being 2 , «23, and ao; , the interaction terms being random for the 
particular set of fixed effects occurring in the data. In this case we are 
assuming an underlying multivariate distribution with x; ;, having mean 


1Parts of this paper are based on the thesis submitted by the first author in partial fulfillment of 
the requirements of the Ph. D. degree. 
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value » + 6; and variance of + oo + a : the covariance between 
vi; and 2;;,, fork ~ k’ is o. + oc, and that between 2; ;: and 23%" 
forj ~jandk #k’, iso. 

Variance components can be estimated in both models by equating 
linear functions of sums of squares to their expected values, the sums of 
squares used being reductions in the total sum of squares due to fitting 
various elements of the model as if it were a fixed model. For example, 
fitting (u + a;) results in matrix equations of the form 


Pa = jy, 
where y is the vector of a-class totals x,.. , and @ is the vector of esti- 


mates of the a’s. P is non-singular, a diagonal matrix of order a, of 
terms n;,. Thus 


@ = P'’y, 
and the reduction in the total sum of squares due to fitting the a’s is 
R(u, a) = ay = y’P'y. 
This is the usual uncorrected sum of squares, namely 


R(u, a) = dX Lin (2) 
Similarly 
R(u, 8) = x Leas (3) 
R(u, x, B, a8) = ye x ti;./Nsi 5 (4) 
R(u) = x7../n.., (5) 
and 


RO = Ld Lan. (6) 


The variance components of the random model are estimated by 
equating the expected values to the observed values of differences among 
the above uncorrected sums of squares, namely (2)—(5), (3)-(5), (4)- 
(2)-(3) + (5) and (6)-(4). These cannot be used in the mixed model 
because, apart from (6)—(4), their expectations then contain functions 
of the fixed effects. The within-subclasses sum of squares, (6)—(4), 
can still be used to estimate the error variance as in the random model, 
and o2 and o%,, can be estimated from the differences: 


R(u, Qa, B) a Ru, 8), 
and 


Ru, Qa, B, a8) hs Ru, Qa, 8), 
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whose expectations are free of terms in the fixed effects. (yu, a, 8) is the 
reduction in the total sum of squares due to fitting a and 6 alone without 
the interaction terms. This reduction in sum of squares does not occur 
in the random model analysis and, when required for the mixed model 
analysis it usually involves a considerable amount of computing. Its 
relationship to the random model analysis will now be shown and a 
simplified computing procedure derived. 


SIMPLIFIED EXPRESSION FOR R (un, a, B) 


Henderson ef al. [1959] have shown that fixed effects in a mixed 
model can be estimated by treating the random effects as fixed and 
maximizing the joint distribution function of the observations and the 
random effects. The resulting equations for the two-way classification 


without interaction are 
QO’ 8 Zz 


where P is a diagonal matrix, of order a, with terms n;,. , R is a diagonal 
matrix of order b — 1 with terms n.; , 7 ¥ b, Q is a matrix of order a 
by b — 1 with terms n;; , j ¥ b, y is a vector of the z,.. totals and z 
is a vector of the z,;, totals. R, Q and z’ have b — 1 columns because of 
omitting the equation for the last 8, appropriate to imposing the con- 
straint 8, = 0 in order to have a unique solution. R(u, a, 8) from these 
equations is 


R(u, a, B) = (@’8’) } 


= wz) P ue 8) 
|Q’ R Z 


This expression for obtaining R(u, , 8) requires inverting a matrix of 
order (a + b — 1), which is not feasible in many situations because a, 
the number of random classes in the data, is large. For example, the 
analysis of dairy production records in Searle and Henderson [1960] 
involved 688 herds (random) and 4 age groupings (fixed) so that a 
matrix of order 691 would need to be inverted for equation (8). How- 
ever, because of its special form, it can be reduced to inverting a 3 X 3 
matrix. Thus in the general case for the two-way classification the 
‘inversion of an (a + b — 1) matrix can be reduced to inverting one of 
order (b — 1) and this is a considerable reduction in the computing 
required especially when a, the number of random effects, is large, 


as is frequently the case. 
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The simplification of (8) proceeds as follows. Let 


E OF ines e | (0) 
Q’ aD 
Then 

A= P'+P'QDQP™, 

B = —P''QD, (10) 


D=(R— OF Q)5 
ES 
=po/Pe D Zz 
y’P'y + (z' — y'P''Q)D(z — P''Q’y). (11) 


The matrix P is easily inverted, being diagonal, and the only non- 
diagonal matrix to invert is D = (R — Q’P'’Q)™ of order 6 — 1. 

The first term in the above expression is R(u, a), and the second 
can be derived from equations (7). Eliminating 


P'(y — Q8), 


a 
gives 

6 = Diz — Q’P'y). (12) 
This is easily computed and can then be used to obtain Rg , the re- 
duction in the sum of squares due to fitting the 6’s in this no-interaction + 
model, as 


Rp = 6'(z — Q’P’’y). (13) 
Thus from (11) R(u, a, 8) can be expressed as 
Ru, a, 8) ae Rly, a) sai Re ) (14) 


the first term of which is part of the random model analysis and the 
second term comes from (12) and (18). 


EXPECTED VALUES 
The expectation of R(u, a, 8) can be found by using (8) and (9) 


and writing 
R(u, a, B) = (y’2’) i a 
D 


= (y’Ay + z’By + y’Bz + z’Dz), 
= tr (Ayy’ + 2Bzy’ + Dzz’), 
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where ¢r(A) is the trace of the matrix A, the sum of its diagonal terms. 
After substituting from (1) for the vectors of totals y and z and also 
using (1) in (8), (4) and (6) it can be shown, with a certain amount of 
algebraic manipulation, that the differences used for estimating the 
variance components have expected values 


E{R(u, a, B) — R(x, B)] 

= (n.. — ko)oe a (kg = ke)oas a Ol lo. ’ 
E(R(u, Qa, 8, a8) ae Ru, Qa, 6)] 

= (n.. — kgloug + (s —-a— b+ Nar, 
E{R(0) ao Ru, a, B, a8) | 

= (n.. — s)o. . 

The variances are estimated by equating the right-hand sides of 
these equations to the calculated values of the differences in the square 
brackets of the left-hand side. The term k;, is one of the coefficients used 
in the random model analysis, namely k, = >.>, [>0°-1 7?,]/n.;._ The 


coefficient kz comes from the expected value of R(u, a, 8) and can be 
expressed in the form 


ke = 2ic (AU + 2BV-- DW), (15) 
where U is a diagonal matrix of order a with terms )Jj=; ni, for 
i = 1-:: a, V is a matrix of order a by 6 — 1 vain een De 
for? = 1--- a,andj =1--- b — 1, and W is a diagonal matrix of 


order b — 1 with terms >’, ni’, forj = .1++-b — 1. 


COMPUTING Rg AND kg 


R, is obtained from computing the following terms: 
(i) A square matrix C, of order (b — 1) with terms 


Ci = ey — DE Raa eh Se ome 
and 
Cj. = — > ning [Ne Se Pama Nps re iki = tl Ne 


Computing c,; provides the check that Segice 7; a Ooruallley, ie. 
row and column totals of the augmented C are zero. 
(ii) A column vector r, of order b — 1, whose terms are 


t= 2; — Di Maks... ; Fan oer aa 


Computing 7, provides the check that the sum of all the 7;’s is zero. 
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(iii) A column vector, of order b — 1, of the estimates of the §’s, 
Med Ores 2 
(iv) The “inner product” of the terms in the preceding two vectors is Reg: 


et 


R, = @r = > by; = 1'er. 
7=1 


This is equation (13), the C~’ used here being identical to D. 

Expression (15) for kg is obtained using the elements of C~’, which 
we shall call d,;, , obtained in the calculating of R, above. The special 
forms of the matrices involved in (15) enables kg to be written as 


a b-1 a bt) b=1 a 
hoe Se Sa es eee 
where 
b=1 
X= Dy ts. , 
7=1 


i (ni;/n:.) (XE; = 2h) 
and 
fisge = ismiy/Ni.) Qi — Nig — Mii’)- 

The f’s, of which 3b(b — 1) in number are required, can be calculated 
for each 7 and summed, and the expression is well-suited to evaluation 
on a computer. Desk calculation can be arranged from the same 
formula when b is small, and the number of a-classes (random) is not 
too large. Calculating all of the 3b(b + 1) f’s enables the following 
check to be placed on them: 


NE, Ce feta) = — Di mismes/ns, . (16) 


i/=1 4 


The f-values and >>; d; can be obtained simultaneously with the terms 
of C andr; Rg is calculated as r’/C~’r and kg is obtained using the elements 
of C~* as above. 


EXAMPLE 
The calculation of Rg and kg is demonstrated for the small hypotheti- 
cal example shown in Table 1. 


The steps for obtaining R, are as follows: 
(i) The c-values are 


¢, = 538 — 17/10 — 37/20 — 127/40 — 127/50 — 257/50 
Coo = 45 — 27/10 — 67/20 — 12’/50 — 25°/50 


33.47, 
27.42, 


I 
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TABLE 1 
HYPOTHETICAL EXAMPLE 


Number of observations n;; Mean 
observed 
7 Totals values, 

a 1 2 3 4 Ni. Ls., 

1 1 2 3 4 10 200 

2 3 6 4 7 20 300 

3 12 — 12 16 40 400 

4 12 12 13 13 50 500 

5 BD 25 — — 50 600 

Totals 
Nj 53 45 32 40 hes 
Totals of observed 
values, x ;. 23000 23000 14000 19000 


Css = 32 — 3°/10 — 4°/20 — 127/40 — 137/50 = 23.32, 
C44 = 40 — 4°/10 — 77/20 — 167/40 — 137/50 = 26.178 
C2 = —1(2)/10 — 3(6)/20 — 12(12)/50 — 25(25)/50 = —16.48, 
Gs = —1(8)/10 — 3(4)/20 — 12(12)/40 — 12(13)/50 = — 7.62, 
and similarly 
C4 = —9.37, Cog = — 6.02, 
Coz3 = —4.92, Cra eion 
The check on these values, namely >>°,_, c;,;, = 0, can be seen to hold 


true; for example 
Ci He Ci2 = C3 + Cr Bye Ay ae 16.48 aoe Uy? cera. 9.37 = 0. 
Gi) The r-values are obtained using the totals of the observations 
for the levels of the fixed effects and the means for the levels of the 
random effects: 


r; =23000 — 1(200) — 3(300) — 12(400) — 12(500) — 25(600) = —3900, 


“r= 23000 — 2(200) — 6(300) — 12(500) — 25(600) = —200, 
rs= 14000 — 3(200) — 4(300) — 12(400) — 13(500) = 900, 
r;=19000 — 4(200) — 7(300) — 16(400) — 13(500) = 3200. 


A check on these is that their sum is zero. 
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(iii) 
3947" 16.48) 7.024a 
Cc? =D =| —16.48 27.42 —4.92 
=7:62 —4.92) 23.62) 
(iv) 
Ra t2Dr 


053824 036902 025373 || —3900 | 
(—3900, — 200, 900)! .036902 .063205 .025393 | —200 
025373 025393 .056530/|_ 900 


736703. 


Calculating kg involves the elements of D and the f-values. The 
latter, together with the \,-terms are shown in Table 2. The calcula- 
tions for z = 1 are as follows. 


h, = (1? + 2? + 3’)/10 = 1.40, 
fia = 0.4 + 10 — 2)/10 = .94, 
fio = 271.4 + 10 — 4)/10 = 2.96, 


fies = 3°14 + 10 — 6)/10 = 4.86, 
fie = 1(2)(1.4 ee he 2)/10 = — 32, 
fiag = 18)0.4 — 1 — 3)/10 = —.78. 


Similarly fi.14 = —1.44, fi.23 = —2-16, fies = —3.68 and fis4 = —6.72. 
The sums of these terms, over all values of 7, are shown in Table 2 
from which the f-values can be checked by the expression (16); for 
example 


SS Guns 4 fh fins fa) 


+ 


I 


505.8357 — 360.9718 — 113.1132 — 158.0607 

= —126.31 

—4?(1)/10 + 7°(3)/20 + 16°(12)/40 + 13°(12)/50 
= —(1.60 + 7.35 + 76.80 + 40.56). 

From the c’s and the f’s kg is now computed as 

kp = 45.79 + [505.8357(.053824) + 436.5532(.063205) 

+ 212.4332(.056530)] 


I 
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TABLE 2 
Terms Usep In CALCULATING kg 

t r; fia fsx fi,ss fi,44 

1 1.40 .9400 2.9600 4.8600 5.4400 

2 3.05 (6725 19.8900 12.0400 22.1725 

= 7.20 83.5200 83.5200 97.2800 

4 | 9.14 101.2032 101.2082 112.0132 112.0132 

5 25.00 312.5000 312.5000 — = 
Total 45.79 505.8357 436.5532 212.4332 236.9057 

u —fijr2 —finr3 fies —fios —fis,o4 —fi,34 

1 3200 .7800 1.4400 2.1600 3.6800 6.7200 

2 5.3550 2.3700 7.2975 8.3400 20.8950 11.1300 

3 — 60.4800 99 .8400 -_— - 99 .8400 

4 42.7968 49 4832 49 4832 49.4832 49.4832 56.9868 

is 312.5000 _ — ee — at 
Total 360.9718 113.1132 158.0607 59.9832 74.0582 174.6768 


— 2[360.9718(.036902) + 113.1132(.025373) 
+ 59.9832(.025393)] 


= 77.16. 


This procedure for obtaining R, and kg may not appear greatly more 
straight-forward than inverting the matrix required in (8) which in 
this case is the 8 X 8 matrix 


I! 
2 
L 3 


10 


20 


40 


3 12 


50 


12 25 


6 O 12 25 


4 12 


te) 


45 


32] 


Advantages are apparent however, when one considers the case of a 
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large number of levels of the random classification 500 say, instead of 5. 
The matrix to be inverted would then be of order 503 while the pro- 
cedure outlined here would still only require inverting a 3 X 3. The 
calculation of its terms the c’s and of the terms for kg , the f’s, is still 
lengthy but can be accomplished separately for each 7 and summation 
made over 7. This can be arranged quite straightforwardly for a desk 
calculator and is easily organized for an electronic computer such as 
the IBM 650, for example. 
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OPTIMUM SAMPLE SIZE IN ANIMAL 
DISEASE CONTROL’ 


A. W. Norpsxoe?, H. T. Davin? anp H. B. Ersenperc! 
Iowa State University, Ames, Iowa, U.S. A. 


1. INTRODUCTION 


The objective of an animal disease control program would either 
be (a) to completely eradicate a disease or alternatively (b) to reduce 
the incidence of a disease to a low level and then to keep it in check. 
In the case of (b) the causative organism might not be completely 
stamped out. 

If a disease has not obtained a strong foothold in a country, then 
objective (a) clearly is justified and perhaps the most drastic methods 
of control are in order. If, on the otherhand, a disease is widely dis- 
tributed both geographically and zoologically, then alternative (b) 
might be more realistic. This implies that the objective of a particular 
control program should take into account the cost of the program 
relative to the price of the consequence if there were no control. 

A case in point is the method of control for Pullorum disease 
(Salmonella pullorum) in poultry. The active form of the disease 
causes an enteritis in baby chicks which may result in high death 
losses. Chicks which recover may be carriers of the disease as adults. 
Female carriers may then transmit the organism via the egg to their 
progeny, where the disease again may become epidemic. Other species 
of domestic birds, and numerous wild species including pheasants, 
quail, and even foxes, cats, swine, cattle and man, have been reported 
as potential carriers of Pullorum. 

Control of this disease is based on a blood testing technique. 
Samples of blood, collected from each member of an adult flock, are 
tested against an antigen of pullorum organisms for agglutinizing 
antibodies. Positive reactors to the blood test are judged to be disease 
carriers and therefore are removed from the flock and slaughtered. 

The blood agglutination test, carried out in most states for the past 
three or four decades, has now reduced the disease to one of relatively 


1Journal Paper No. J-4047 of the Iowa Agricultural and Home Economics Experiment Station, 


Ames, Iowa. Project No. 1326. 
2Department of Poultry Husbandry. *Department of Statistics. ‘Department of Statistics; now 


at System Development Corporation. 
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minor importance. For example, the number of reactors reported by 
participants in the Iowa Control program was less than .07 of one 
percent in 1959 while in the Massachusetts program no positive reactors 
were reported in that year. 

When a disease such as Pullorum is widely distributed but at a low 
incidence on a state-wide or national level, it becomes problematical 
whether to continue past practices of completely blood testing all birds 
in a flock, or whether blood testing only a sample, or even no birds at 
all, of each flock might be more economical. 

The present paper investigates the extent to which 100 percent blood 
testing is justified, assuming random, that is binomial, infestation of 
adult flocks. Specifically, we attempt to determine the most economical 
flock proportion to be blood tested in any given year and locale, as a 
function of certain values and costs and of the prevailing binomial 
infestation rate. 

Economic optimizations in the presence of binomial a priori distri- 
butions have been studied previously, as for example in fi2e iss 
and [4]. However, this prior work is almost entirely concerned with 
industrial inspection problems. The present paper is intended in part 
to illustrate the fact that such methodology, initially intended for 
industrial application, is equally applicable in the biological realm. 

We recognize that the assumption of binomial infestation might in 
certain instances be improved upon, by suitable contagion models for 
example. However, this assumption seems not unreasonable in the 
case of Pullorum; in this case, the adult flock is composed of birds 
which, though able to transmit the disease to their progeny through 
the egg, have low contagious influence as carriers. 


2. ALTERNATIVE FLOCK TESTING POLICIES; 
VALUES AND COSTS 


We consider the following one-parameter (the parameter is 7) 
family of flock testing policies: 


Sample and blood test birds out of a flock of N. If the 
sample contains no reactors (reaction assumed equiva- 
lent to infection), return the n birds to the flock and 
blood test no more. If the sample contains one or more 
reactors, blood test the entire flock; slaughter all dis- 
covered reactors. 


The relative economic worth of each of these N + 1 competing 
flock testing policies are now evaluated in the light of the following 
values and costs. 
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a: blood test cost per bird, 

CG: carcass value per bird, 

a(k): average value of a member of a flock of size N that 
contains k undiscovered reactors, 

b(k): value of a non-reactor from a completely tested 
flock of size N containing k discovered reactors. 


Note that a(0) = 6(0) is the value of a member of a flock con- 
taining no reactors. Again, the value of a discovered reactor from a 
completely tested flock isc. Finally, the precise shapes of the function 
a(k) will depend on the likelihood that the progeny from a mating 
involving an undiscovered reactor will become acutely infected, thereby 
precipitating an epidemic in the progeny flock. The shape of the 
function b(k) will relate largely to loss of good-will. If good-will is 
not an important factor, it will not be unreasonable to set b(k) = 6(0) 
for all k; indeed, this is the assumption made at the end of Section 5. 


3. PRELIMINARY CONSIDERATIONS 
Computationally tractable forms of a(k) and b(k) are as follows. 
a(0) = A,a(k) =a for k #0, 
b(0) = A, b(k) = B for k #0. 


(1) 


It is shown below that the most profitable n is either 0 or N. If this 
can, for the moment, be assumed, then the derivation of the most 
profitable sample size (i.e. the choice between 0 and N) becomes a 
matter of simple arithmetic, at least for the limiting cases of very small 
or very large flock size N. 

Consider a large number M of birds, grouped into m flocks of 
N = M/m birds. The case of very small flock size is typified by N = 1, 
while the case of very large flock size is typified by m = 1. 

For N = 1, the average value of the M birds (i.e. one-bird flocks) 


equals 
M1 — A+ Mra, if 790) (2a) 
M(1 — r)A + Mae — Mi, lt 97 (Cae (2b) 
so that 100 percent testing will be more (less) profitable than no testing 
according to whether 
. Pee (nia. (3) 


For m = 1, the average value of the M birds (i.e. one flock of M 
birds) equals 
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MQ — ma+ Mra = Ma, to 7 = 0, (4a) 
M1 — m6 + Mare — M,, fi n=N GD, (4b) 


so that 100 percent testing will be more (less) profitable than no testing 
according to whether 


B-—a>(<)r6—6 +7 (5) 


Expressions (2a) and (2b) arise from the fact that, under both zero 
and complete testing, a proportion (1 — 7) of the M one-bird flocks 
will be disease-free, contributing an amount M(1 — aw) A to average 
value. In addition, there will be a value contribution from the Mr 
one-bird flocks containing a reactor; this contribution will amount to 
Mre in the case of no testing and to M7c in the case of 100 percent 
testing. Finally, there is the testing cost Mz that is incurred under 
100 percent testing. 

Expressions (4a) and (4b) arise from the fact that a very large 
flock will contain an approximate proportion a of reactors; hence, 
under 100 percent testing, there will be approximately Mr birds con- 
tributing carcass value c to the flock, and approximately M(1 — 7) 
birds contributing value 6. Similarly, if WM is large enough to make 
Mr large, the flock of M birds will contain at least one reactor with 
probability essentially 1, leading to an average per-bird value of a in 
the absence of testing. 

The criteria represented by (3) and (5) constitute an almost adequate 
solution for the case of the value assumptions given in (1). The further 
computations of Section 4 will serve only to validate the assertion that 
the most profitable sample size must either be zero or N, and will lead, 
as well, to the analogues of (3) and (5) for flocks of intermediate size. 


4. THE COMPUTATION OF THE MOST PROFITABLE SAMPLE SIZE 
FOR THE CASE OF CONSTANT VALUE DIFFERENCE 0(k) — a(k) 


This is the case typified by form (1) of the functions a(k) and b(k). 
The computations proceed as follows. Let 


¢, = the binomial probability of k reactors in a flock of 
SE! = oO eaters (6) 
and let 


h,., = the hypergeometric probability of obtaining 7 non- 
reactors when drawing a random sample of size n 
from a flock of N birds containing k reactors and 
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(N — k) non-reactors = G na ") y at (7) 


n n 


Then for any particular one of the (V + 1) alternative policies pre- 
sented in Section 2, say the policy corresponding to sample size n, 


Pr {number of reactors in the flock = k; number of re- 
actors in the sample = 0} = ¢,h,.., (8) 


and the “profit’’ ensuing if the number of reactors in the flock equals 
k and the number of reactors in the sample equals 0 is 


Natk) — ni. (9) 


Hence the contribution to expected profit of the events involving no 
reactors in the sample equals the sum of the products of (8) and (9): 
N-n 
> [Natk) — ni} -dbe-hn.n - (10) 
k=0 


Again, for the same sample size n, 


Pr {numbers of reactors in the flock = k; number of re- 
actors in the sample > 0} = ¢,-(1 — h,.,), (11) 


and the “‘profit’’ ensuing if the number of reactors in the flock equals 
k; and the number of reactors in the sample exceeds 0 is 


(N — k)-b(k) + ke — Ni. (12) 


Hence the contribution to expected profit of the events involving one 
or more reactors in the sample equals the sum of the products of (11) 
and (12): 


N 
> (N — k)-b(k) + ke — Ni)-¢e-(1 — hyn): (13) 
k=0 

Total expected profit will equal the sum of (10) and (13), which, 


neglecting terms not involving n and using the fact that h,,, = 0 for 
k >N —n-+ 1, can be written 


S {N-[a(k) — b(k)] + &-[b) — ec] + (N — 1) -t} bert 14) 


Expression (14) is further reducible to 
(1 — 1)": {N-Ela(k) — b(%)] + Elk-(b(k) — o)] + (N — n)-t}, (18) 


where the expectation E[ ] is with respect to the chance variable k 
having a binomial distribution with parameters (V — n) and r. This 
type of expectation arises from the fact that 
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bilin.e = (1 — oy (® ° "e( = Peal 


Further reduction leads, except for the additive term (1 —r)*-N(6—a), 
to the expression 


(1 — x)"(T — nS) = PO), (16) 

where 
T=Nie—6 +76 —o +4, (17) 
S = 76 —c) 1-0: (18) 


Differencing P(n) now yields 
A(n) = P(n + 1) — P@) = (1 — 2)"[(mr — 18 — m(T — S)], 


which shows that A(n) is negative for n < (7/8) + (l/r) — 1, and is 
positive for n > (T/S8) + (1/r) — 1. This implies that no P(n) for 
0 <n < N can be larger than the larger of P(0) and P(N), which in 
turn implies that the most “profitable”? sample size n either is 0 or NV. 
Establishing this fact (which is reminiscent of conclusions reached in 
[1] and [2]) was one of the two objectives set for this section in the last 
paragraph of Section 3. 

The second objective set for this section was to derive a condition 
analogous to (3) and (5) for determining the relative profitabilities of 
the two sample sizes 0 and N for intermediate flock size N. But this 
now is simply a matter of comparing P(0) and P(N), where P(n) is 
given by (16). This yields the conclusion that 100 percent testing is 
more (less) profitable than no testing at all according to whether 


(6 — afl — (1 — 2") > (<) eB — 6 +1. (19) 


We note that (19) does indeed specialize to (8) and (5) for N equal, 
respectively, to 1 and to ~. 

Although details are outside the scope of this paper, it may be of 
interest to point out that condition (8) arises naturally in the computa- 
tion of the sequential Bayes test policy for the value assumptions (1). 
Consider the much (though no doubt impractically) enlarged set of 
flock testing policies consisting of all sequential plans with terminal 
acts A and R: 


A : Stop testing; collect ¢ for every reactor culled out so far; return 
non-reactors to the flock; eventually collect a or 6 per bird of 
the unculled portion of the flock, depending on whether or 
not this portion contains at least one reactor, unless the entire 
flock is reactor-free, in which case collect A. 
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R : Test 100 percent; collect ¢ for reactors, and @ for non-reactors, 
unless the entire flock is reactor-free, in which case collect A. 

A straightforward application of the methodology given in [2] then 
shows that 100 percent testing is the most economic of the policies in 
this enlarged set if r(¢ — a) > 7. However, the complementary pre- 
scription for no testing if r(¢e — a) <i Hoes not apply in this case. 


5. THE COMPUTATION OF THE MOST PROFITABLE SAMPLE SIZE 
FOR THE CASE OF LINEARLY INCREASING VALUE 
DIFFERENCE 0(k) — a(k). 


A value assumption alternative to (1) is 


atk) A-S(4- 2 Oe Ne 


(20) 
i= AeA ee 


Using (15) [which was derived without reference to any specific form 
of a(k) and b(k)], expected profit now becomes, except for additive 
constants not involving n, 


Ps (a + ns), (21) 

where 
T=(A — da —7)\(N —1)+ N[ry — oc) + 7, (22) 
25 ea yt ey (23) 


The function P(n) given by (21) is best described in terms of the follow- 
ing five parametric cases. 

Case I: S > 0, T = 0. Asn increases from — ©, P(n) rise steadily 
from — ©, crosses the n-axis at n = —T/S, equals T at n = O, turns 
downward somewhere between n = —T/S and n = N, crosses the 
n-axis once more at n = N, turns upward somewhere beyond n = N, 
and approaches the -axis asymptotically from below as n approaches 


+o. 

Case JI: S > 0,0 > 7 > —NS. As 1 increases: from — =, P(n) 
rises steadily from —, equals 7 at n = O, crosses the n-axis 
at n = —T/S, turns downward somewhere between n = —T7/S and 


n = N, crosses the n-axis once more at n = N, turns upward somewhere 
beyond n = JN, and approaches the n-axis asymptotically from below 


as n approaches + ©. 
Case IIT: S > 0, T < —NS. As n increases from —~, P(n) rises 
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steadily from —@, equals T’ at n = 0, crosses the n-axis at n = N, 
turns downward somewhere between n = N and n = —T/S, crosses 
the n-axis once more at n = —Z'/S, turns upward somewhere beyond 
n = —T/S, and approaches the n-axis asymptotically from below as n 
approaches + ©. 

Case IV S = (0; T > 0, Asm meresses trom P(n) decreases 
steadily from +, equals 7 at n = 0, crosses the n-axis at n = N, 
turns upward somewhere beyond n = N, and approaches the n-axis 
asymptotically from below as n approaches + ©. 

Case V: S = 0, T < 0. As 7 increases from —™%™, P(n) increases 
steadily from — ©, equals 7’ at n = 0, crosses the n-axis at n = N, 
turns downward somewhere beyond n = N, and approaches the n-axis 
asymptotically from above as n approaches + ©. 

Cases IV and V are easily summarized as follows: If A = 6 (corre- 
sponding, as indicated at the end of Section 2, to the absence of the 
good-will factor), then the only policies in contention are 100 percent 
testing (n = N) and no testing (n = 0), and 100 percent testing will be 
more (less) profitable than no testing according to whether 


nla yee eae (24) 


It seems of interest to note the resemblance of (24) and (8). 

Cases III and V are summarized by: If 7 + NS < 0, it is most 
economical to test 100 percent. 

For Case II, the most economical sample size is the n between 
—T/S and N at which P(n) turns downward. This case is of interest 
since it establishes the possibility of an optimum sample size other 
than 0 or N. This possibility has already been noted in [3]. 

For Case I, the most economical sample size is either n = 0 or the 
n at which P(n) turns downward, depending on the relative magnitudes 
at these two sample sizes. Note that, for T = 0, the n at which P(n) 
turns downward must be greater than zero, so that, as in Case II, the 
most economical sample size will be other than 0 or N. 


6. EXAMPLE AND CONCLUSIONS. 


Consider the case when good-will is not a factor, i.e. the case of 
constant b(k). In this case both the formulation of Section 4 and that 
of Section 5 imply that only the two policies of zero and 100 percent 
testing are in contention, the choice between these depending on the 
direction of a simple inequality. 

Defining the critical testing cost 1, to be the testing cost for which 
zero and 100 percent testing are equally economical, it seems of interest 
to compute 7, for both formulations, using comparable value figures. 
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A typical value for a non-reactor in a fully tested flock is $3, a 
typical average value of a member of an untested flock is $2.40, and 
typical values for c, 7 and N are $.50, 10°° and 500. 

Computing 7, in the spirit of Section 4, we therefore set A = 8 =$3 
and a = $2.40. Replacing 1 — (1 — 2)” by Nz (allowable since Nx 
is small), criterion (19) then becomes $3 X 107° > (<) $2.5 X 107° 4+ 7, 
which means that 7, = $0.003, ie. that no testing is most profitable 
unless the cost of testing falls below 3 mills per bird. 

Computing 7, in the spirit of Section 5, we set A = 6 =$3. In 
addition, we interpret $2.40 to be the value of the linear function a(k) 
evaluated at k = Nz, the expected number of reactors in the flock. 
This implies a per-bird disaster value of y = —$6 X 10* for an un- 
suspected 100 percent infected flock. Criterion (24) then becomes 
$0.6 > (<)z, which means that 7, = $0.60, i.e. that 100 percent testing 
is most profitable unless the cost of testing rises above 60¢ per bird. 

In practice, the cost of testing is approximately seven cents per 
bird. Since this cost is well bracketed by the critical costs 0.3¢ and 
60¢ derived above, we learn that the shape of the value function a(k) 
{and of course also that of b(k)] must be determined rather accurately 
if the methodology presented here is to be applied. 
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NUMERICAL ASPECTS OF THE REGRESSION 
OF OFFSPRING ON PARENT" 


H. E. McKezan anv B. B. Bonren 
Population Genetics Institute, Purdue University 
Lafayette, Indiana, U.S. A. 


INTRODUCTION 

In an earlier paper (Bohren, McKean, and Yamada, [1961]) three 
currently employed techniques for estimating the regression of off- 
spring on parent, and thereby heritability in the narrow sense, were 
compared and contrasted with respect to their efficiencies of estimation. 
The general conclusion, based on theoretical considerations and an 
empirical study of five generations of a closed poultry flock (Yamada, 
Bohren, and Crittenden, [1957]), was that under the circumstances 
considered, the method of regression of offspring means on parent’s 
record (method 1) was inferior to the method of regression of individual 
offspring on parent (method 2) (with the parent’s record repeated once 
for each of its offspring) and to (method 3) the Kempthorne-Tandon 
technique (Kempthorne and Tandon, [1953]). 

The success of the Kempthorne-Tandon technique depends upon 
knowledge of a parameter p, the correlation between deviations of two 
offspring of the same parent from the predicted breeding value of the 
parent, and its expected superiority over the second method depends 
upon the magnitude of p. Usually p is guessed in the light of prior 
knowledge, and weights are assigned to the families according to the 
guessed value of p. In the first paper it was shown that, under the 
assumption of all genetic variance being additive, p < .067 or < .079, 
depending upon whether the mating structure is random or hierarchal. 

The results obtained in the previous paper specifically depended 
upon the particular distribution of family sizes encountered in the five 
analyses, and upon the accuracy of the estimated values of T=p/(1—p). 
The purposes of this paper are to consider the efficiency loss incurred 
by mis-guessing p in the Kempthorne-Tandon technique, and to in- 
vestigate the factors involved in the relative efficiency of the other 
two methods. 


THEORY 


We consider a breeding experiment in which s sires are selected 


1Journal Paper Number 1688, Purdue University Agricultural Experiment Station. 
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from the population, sire 7 being mated to a random sample of d; distinct 
dams. The mating structure is then hierarchal, with the progeny of 
sire 2 having phenotypic values given by the model 

Vine = wie + BX = ph) =P eis ’ (1) 
which is equation (11) of the previous paper. 

It has been pointed out that the three methods under consideration 
are merely special cases of the general unbiased weighted regression 
estimation procedure. The difference between the methods involves 
only the weights applied to each progeny-group deviation: 1) wy = 1 
for the progeny means on parents technique, (2) w;; = n;; for the 
repeated parents technique, and (3) w,; = n,;/(1 + n,,;7) for the 
Kempthorne-Tandon technique, where 7 is a guessed value of 7. If 
7 = T, the third technique is the minimum variance technique, whereas, 
if r = 0, the third technique reduces to the second. Furthermore, 
when the family sizes are equal (n;; = nig = --- , for all 2), all three 
methods are identical in efficiency. Since most experimental data 
will involve unequal family sizes, interest will center on considering 
this situation. 

The question, ‘When may I use method 1 with little loss in effici- 
ency?’’, is a pertinent one. First, let us examine the optimum choice 
of weights for which the variance of the estimate of 6 will be minimized. 
These optimal weights w* (say), where w* = n,;/(1 + n,;T), will be 
approximately equal (hence method 1 appropriate), for 7 > 0, under 
one or more of three distinct sets of circumstances: 

1) All n;; are large. This follows immediately from 


ij | Ni; | el ; 

ves 1 + nT 7 Uh 
2) S* , the variance of the family sizes, is zero or very small. 
3) T is large. This follows from the fact that 


* 
2 tO 
A, 
Too Wir 57 
(or p> 1) 


independent of n;; and n,-,;, ; thus for large 7, w* = w*,,,. In view of 
this, method 1 may also be considered as a special case of method 3 
where 7, the guessed value of 7’, is allowed to approach ~. 

It is of interest, therefore, to determine a readily accessible statistic 
which depends upon 7% (the average dam family size), S; , and 7’, upon 
which a decision to use or not use method 1 as opposed to method 2 


may be based. . 
It is easy to show that the coefficient of variation among the optimal 
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weights w% is approximately 


oS Va) 
WZ) ~ AL + AT) 1+nT’ 


where C.V.(n) = S,/n = coefficient of variation of the dam family 
sizes. Since ¥(T) is a function of all three of the statistics mentioned 
above, it is of interest to consider the relationship between relative 
efiiciency (Min V(8)/V(@)) of the various techniques and y(7'’) [even 
though ¥(7’) may seriously differ from the true coefficient of variation 
of the w ]. 

For any choice of r, we may define 


Wa) = CVG a): (2) 


Clearly, (0) = C.V.(n), ¥(e) = 0, and ¥(r) is a monotonically de- 
creasing function of 7. Since the range 0 < 7 < o corresponds one- 
to-one to the range in y of C.V.(n) > ¥ = O, it is more convenient to 
express graphically a relationship between y, T, and relative efficiency 
rather than 7, 7, and relative efficiency. 

Given a fixed set of family sizes m1, , M12 , ‘-* the relative efficiency 
is a function of 7, the true unknown parameter of the population, and 
r, the guessed value. Because of the one-to-one relationship between + 
and (7), the relative efficiency may be expressed as a function of 7 
and y. The explicit form of this function may be obtained by solving 
equation (2) for 7 and substituting the resulting expression in wy into the 
equation for relative efficiency: 


RO, 2) = D2 /oid/D W/o), 


where 8; = estimated regression within the 7th sire using 7 as the guessed 
value of T, 

8* = estimated regression within the 7th sire using 7 = 7’ (optimum), 
and 


F 1 nigh es eee 
a 1 — +a 2 ( = auc 
of, = a ( p) rE ir (1 + n,;7) Dy Wa Xk) 


og; = 6 (1 = p)/ Dy Wa Kay a) 


which correspond to equations (15) and (16) of the previous paper. 
The graph of a typical relationship between y, 7’, and relative 

efficiency is given in Figure 1. The z- and y-coordinates of any point 

on the surface pictured represent respectively, the coefficient of varia- 
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FIGURE 1. 


GRAPHICAL REPRESENTATION OF A TYPICAL RELATIONSHIP BETWEEN « = y, y = T, 


AND 2 = RELATIVE EFFICIENCY. 


tion of the weights resulting from a choice of 7, and the true value of 
the parameter 7, whereas the z-coordinate is the resultant relative 
efficiency. Several interesting properties should be noted: 


1) 


2) 
3) 


4 


a 


5) 


6) 


The points in the (a, y)-plane corresponding to r = TJ’ (optimal 
efficiency) lie on a curve whose equation is « = y¥(T). The 
surface representing relative efficiency thus has a ridge which 
coincides with the projection of this curve onto the plane z = 1. 
The relative efficiency falls off from this ridge in all directions. 
The relative efficiency of method 1(r = ©) is a strictly in- 
creasing function of 7’. 

The relative efficiency of method 2(7 = 0) is a strictly decreasing 
function of 7. 

The relative efficiency of method 3(0 < +r < @) is strictly in- 
creasing for 0 < 7 < 7 and strictly decreasing for7 < T < o. 
For any fixed value 7, relative efficiency is a strictly increasing 
function of 7 for 0 < 7 < T and strictly decreasing for 7 > T. 
This family of curves, generated by all possible 7’ values, ex- 
presses graphically the consequences of mis-guessing 7’. 
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7) If C.V.(n) is small, the whole picture is condensed along the 
z-axis so that mis-guessing the value of 7’ will not result in 
serious loss in efficiency. If C.V.(n) = 0, the surface is degenerate, 
and reduces to the line (x = 0, 2 = 1). Alternatively large 
values of C’.V.(n) may result in serious efficiency loss if |7 -—T | 
is large. 


AN ILLUSTRATION 


In order to consider the effect of a poor guess for p, we will consider 
the poultry population previously studied by Bohren, McKean and 
Yamada [1961]. Five years data, 1952-1956 inclusive, are included 
in the analysis. The information pertinent to the population size and 
structure are given in Table 1 along with the average family sizes and 
the coefficients of variation of n. The relative efficiencies of the three 
methods for the year 1952 are shown in Table 2 for a range of values 


TABLE 1 

Tur Famity STRUCTURE OF THE POPULATIONS STUDIED. 

No. No. No. Average C.V. (Family 
Year Sires Dams Pullets Family Size Size) 
1952 10 106 605 eel .74 
1953 10 92 877 9.5 43 
1954 ial 78 573 ee 41 
1955 18 108 852 7.9 AT 
1956 20 132 als 5.4 54 


of J and 7 between zero and .07. This covers most of the theoretical 
range of p, under the assumption of additive genetic variance, as found 
by Bohren, McKean, and Yamada [1961] and includes the estimates of 
T obtained from these data. It is seen that in no case is the efficiency 
of either the repeated parent technique (r = 0) or the Kempthorne- 
Tandon technique less than .97. If an intermediate value of 7 is chosen 
under the Kempthorne-Tandon technique (say .03) it is seen that the 
efficiency does not decline below .99. The efficiency of the means 
technique is seen to be consistently poorer than the other two techniques. 

Analogous tables were generated for the years 1953-1956. Es- 
sentially identical results were obtained for all five years analyzed. 
In all years the relative efficiency of the Kempthorne-Tandon technique 
remains above 97 percent for 7 < .07 and 7 < .07. 

Method 1 was consistently less efficient than method 2 in all years 
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TABLE 2 
RELATIVE EFFICIENCIES OF THE THREE Metuops oF EstiMatTING 8 
RELATIVE TO THE MinruuM VARIANCE ESTIMATOR OF (XCM = =) 
FOR VARIOUS SMALL VALUES oF 7’ FoR 1952 Data. 


Method 1 Values of 7 used in the KT technique (Method 3) 
(Means 

i Tech. ) 002 01 .02 .03 04 05 .06 .07 

.00 .5836 1.0000 .9988 .9958 .9919 .9872 .9822 .9770 .9716 
01 .6083 9988 1.0000 .9991 .9969 .9938  .9901 .9861 .9818 
.02 .6302 9958 .9991 1.0000 .9993 .9976 .9951 .9922 .9888 
.03 .6499 9917 .9969 .9993 1.0000 .9995 .9981 .9961 .9936 
.04 .6678 9870 .9937 .9976 .9995 1.0000 .9996 .9985 .9968 
05 -6840 9819 .9901 .9951 .9981 .9996 1.0000 .9997 .9987 
.06 .6989 9766 .9860 .9922 .9961 .9984  .9996 1.0000 .9997 
07 .7126 9713 .9818 .9889 .9937 .9968  .9987 .9998 1.0000 


27 = 0 is Method 2 


over the range 0 < T < .07, but was relatively more efficient than in 
1952. Letting H denote the efficiency of method 1 (means), then, as 7 
rises from zero to .07, in: 


1952, E rises from .5836 to .7126, 
1953, E rises from .9142 to .9615, 
1954, # rises from .9046 to .9587, 
1955, E rises from .9370 to .9682, and 
1956, # rises from .8058 to .8703. 


Table 3 gives a direct comparison of the relative efficiencies of 
methods 1 and 2 when large values of T are involved, which may, but 
do not necessarily, arise when maternal effects, non-additive genetic 
effects, and/or environmental correlations are present. It is interesting 
to note that the T-value at which the two methods are equally efficient 
varies considerably due to differences in the distribution of dam family 
sizes, varying from approximately 7 = .09 in 1953 to 7 = .82 in 1952. 
The corresponding values of ¥(7’), the coefficients of variation of the 
optimum weights w* , show much more year-to-year stability, how- 
ever, ranging from a low of .23 in 1953 to a high of .27 in 1952. 

Let 7, represent the value of 7 for which the two methods are 
equally efficient. The values ¥(7,) will not ordinarily be limited to as 
small a range as observed here (see Table 4). It is quite possible to 
have much lower values of C.V.(n), which is an upper bound for ¥(7,). 
Hence one cannot claim any invariance properties for ¥(7)). On the 
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TABLE 3 
RELATIV® EFFICIENCIES OF METHODS 1 AND 2 POR LARGE VALUES OF T. 
Year Method 10 .20 .30 .40 .50 
1952 1(Means) .748 .825 .870 .900 .920 
2s 0) .956 .912 .880 .856 .838 
1953 1(Means) .970 .984 .990 .993 .995 
27 = 0) .963 .932 .912 .898 .889 
1954 1(Means) .968 .984 .990 .993 .995 
27 = 0) 971 947 .932 .921 .914 
1955 1(Means) 974 .985 .990 .993 .995 
27 = 0) .974 .952 .937 .926 .918 
1956 1(Means) 887 .924 .945 .958 .966 
20) .975 .945 .920 .901 .885 
TABLE 4 
ILLUSTRATION OF THE Utiuity or (7) In CHoosINa 
BrerwEen Metuops | AnD 2. 

Estimated Standard Error 
Year ie WT) WT) Method 1 Method 2 

1952 .059 5) SPA PAL .102 

1953 .029 BY! 120 .085 .083 

1954 .051 sol) 24 m2 .148 

1955 .055 83 .26 .086 .084 

1956 .034 46 24 .108 .094 


basis of information available the recommended procedure for selecting 
an estimation technique is as follows: 


AS 


B: 


From previous experience, or by any other reasoning, assume a 
value 7 for T. 

If possible, use the Kempthorne-Tandon technique, constructing 
weights using this assumed value r. 


. If it is neither possible nor convenient to use method 3, compute 


¥(r). For large values of (7) (say > .4) preference is for method 
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2 whereas for small values of ¥(r) (say < .1) method 1 is preferred. 
For intermediate values of ¥(r) there should be no substantial 
differences between methods one and two. 


To illustrate the use of this approach, consider Table 4. Here, 
estimates of 7’, obtained by Bohren et al. [1961], for each of the years 
1952-1956, are used to calculate ¥(T). For comparison, ¥(7'y) is also 
entered, together with the estimated standard errors of 8. The values 
¥(T) indicate a preference for method 2 in each case, but 7) is obtained 
only after tedious calculation. The rule of thumb suggested would 
lead to strong preference for method 2 in 1952 and 1956 and no preference 
in the other years. 

The results of these investigations indicate that in many circum- 
stances a poor choice of technique can result in serious loss in efficiency 
in the estimation of 8, and thus of heritability in the narrow sense 
(h--= 28); 

It would seem that, once the expense of experimentation and data 
collection has been incurred, the investigator is under an obligation to 
obtain highly efficient estimates. However, in some instances the loss 
in efficiency is negligible if one of the two standard techniques is used. 
The use of ¥(r) to aid in the choice between the standard methods 
seems promising. 
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SOME HYPOTHESES CONCERNING TWO 
PHASE REGRESSION LINES 


P. SPRENT 


East Malling Research Station, 
Maidstone, Kent, England. 


1. INTRODUCTION 


The regression of a growth measurement y on time x can often be 
reasonably represented by two intersecting straight lines, one being 
appropriate when x takes values below and the other when z takes 
values above a certain fixed but often unknown value corresponding 
to the intersection. Such regressions are here called two-phase re- 
gressions, the intersection of the phases being referred to as the change- 
over point, and the value of x at which it occurs being called the change- 
over value. 

Situations in which such regressions might occur include the onset 
of a disease resulting in a reduced growth rate; the application of a 
treatment having an immediate stimulating or inhibiting effect; the 
occurrence of an extremely hot or cold day or some other change in 
external conditions; physical injury of an organism. 

In a study of the compatibility of peach scions on plum rootstocks 
Garner and Hammond [1938] noted that the peach variety Hale’s 
Early developed at constant but different rates on compatible and 
incompatible rootstock-scion unions up to a certain date. After that 
date the growth rate in the compatible case continued at a new constant 
rate, whilst in the incompatible case all growth then ceased. 

Thus for a compatible union there was a typical two-phase regression, 
whilst for the incompatible union a rather special case occurred in 
which the slope of the second phase was zero. A further example is 
given in Section 4 in which the date of phase change in relation to 
time elapsed after application of treatments is of interest. 

If x and y are growth measurements on two different parts of the 
same organism, and Huxley’s allometric growth law operates (i.e. there 
is a linear relation between log x and log y) it has sometimes been 
found that sudden changes in slope occur. Reeve and Huxley [1945] 
have discussed this situation with reference to changes in growth 
equilibrium in crustacea at sexual maturity. Skellam et al. [1959] also 
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refer to such phase changes in allometric growth, mentioning earlier 
work of Teissier on this subject. 

A biologist will often postulate a two-phase linear regression rather 
than some alternative such as a parabolic one on largely intuitive 
grounds, and his decision on this point must be to some extent a matter 
of experience and common sense, as is generally the case in selecting 
appropriate hypotheses and models for statistical examination. In 
many cases a two-phase regression can only be a reasonable approxi- 
mation, adequate for many purposes, but by no means a complete 
description of what is taking place. 

It is hkely that two-phase regressions also arise in economic and in 
industrial production problems. 


2. HYPOTHESES 


In the present context questions of the following type are likely to 
interest the experimenter. 

A. Necessity for, and adequacy of, a two-phase fit. 

B. Investigation of change-over points and values for individual 
regressions. 

C. Relationships between lines within each phase for several re- 
gressions. 

D. Relative positions of change-over points for several regressions. 
For example, interest may attach to one or more of the following 
hypotheses. 

In relation to A: 

AI. One phase adequate (i.e. a conventional linear regression 
suffices), 

AIT. A two-phase linear regression is adequate; 

In relation to B: 

BI. The observed change-over value is consistent with a specified 

. theoretical value x = 7; 

In relation to C: 
CI. The lines within a phase are all parallel to one another, 
CII. The lines within a phase all coincide; 

In relation to D: 

DI. The change-over points are collinear, all lying on a specified 
line y = a + 62, (or x = 7 if @ infinite), 

DIT. As for DI except that a (or y) not specified, 

DITII. The change-over points are collinear on an unspecified line. 

In practice, anyone accepting AJ would proceed no further in this 
contest. If AZ were rejected and AJI accepted, a test of BI might be 
required, and with more than one set of data an experimenter may wish 
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to proceed to some of the C or D hypotheses. For instance, he might 
test and accept CJ, reject CIT as the result of a test (or without test if he 
considered it incompatible with his data) and then test and accept 
DII for the case of infinite slope. If 2 were a time measure, acceptance of 
DII with 8 infinite would imply that all phase changes occured at the 
same time, and is thus likely to be of practical interest. If one wants 
to know if a phase change can be associated with an event occurring 
at a particular x value, such as the application of a treatment, or the 
occurrence of a very hot day, a test of BI is required. 

The above hypotheses are by no means exhaustive. Concurrence 
within a phase, for instance, may be of more interest than parallelism. 

In this paper two assumptions that will be made are (i) the classical 
regression assumption that within each phase the observed y are normally 
distributed with mean zero and standard deviation o about the true 
regression line, and (ii) that it is known between which two observed x 
the change-over occurs, but not precisely where between these two 
values it occurs. 

This latter assumption calls for some comment. As will be seen, 
it leads to certain simple tests, and in many practical cases it is not 
unreasonable. Sometimes observations can only be taken at widely 
spaced 2-values, and it may be clear that the change-over has occurred 
between two such observations, without a more precise statement 
being immediately possible. 

A difficulty arises if lines are fitted to each phase making the as- 
sumption (ii), and the intersection of the fitted lines has an x co-ordinate 
falling outside the assumed range. The experimenter must then decide 
whether to attribute this to sampling errors, or an incorrect assumption, 
and determine the appropriate procedure accordingly. This matter is 
not pursued further here. 

If the second assumption cannot be made because of doubt about 
the allocation of a small proportion of points near the change-over, 
the author conjectures that the methods of Quandt [1958] could be used 
to allocate these doubtful points without seriously upsetting the pro- 
cedures suggested here. Quandt deals with the allocation problem for 
a two-regime y-on-x regression where the x occur in a known order 
with respect to a time measure ¢, and the phase change occurs when a 
certain t-value is reached. In the present case x = t. 

This paper deals mainly with questions classified under B and D, 
but we first briefly discuss hypotheses involving A and C. 

If one has an independent estimate of error variance, the test of AJ 
is simple and standard, being a test for significance of deviations from 
regression. If deviations from regression provide the only estimate of 
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error variance and indicate significance of linear regression, some judge- 
ment is needed to decide whether a two-phase regression (or any other 
fit such as a quadratic regression for example) is worth considering. 
The test of A/T is likewise essentially a test of the adequacy of two 
linear regressions, one in each phase. 

If AJJ is accepted for more than one set of data and tests for 
parallelism or coincidence (CZ, CI) are required, these may be per- 
formed by standard methods for each phase separately. 

We now consider other tests. 


3. TESTS 


Throughout summations over r run from 1 to 2 (corresponding 
to the two phases); those involving s run from 1 to n (corresponding 
to n sets of two-phase regressions); those involving k run from 1 to 
m,, for the appropriate set and phase, m,, being the number of observa- 
tions in the rth phase of the sth set. Where it is obvious how many 
equations (usually 2, n or 2n) of a given form exist, only typical ones 
are given. 

The first test considered is that of BIJ, that is compatibility with a 
fixed change-over value, assuming A//. The test is based on the 
comparison of the fit by standard regression techniques of uncon- 
strained lines in each phase, with that obtained if the lines are con- 
strained to intersect at a point whose « co-ordinate y is given. 

Appropriate models for the first and second phases respectively are 
E(_y) = a + 6.x and E(y) = a, + 6.2, where the constraint to give an 
intersection at = y 1S ag — a + y(B2 — B,) = 0. 

In view of the regression assumptions the method of least squares 
is appropriate, and a, , a2 , b, , b2 , the estimators of a, , a, , Bi , Bs, 
are to be determined so as to minimize 


Ds a (Yr — A, — 0,24)" 


subject to 
de — a + ¥(b, — b,) = 0. (1) 


Although not essential, introduction of a Lagrange multiplier \ 
preserves a certain symmetry, and gives rise to many expressions 
similar to familiar regression results, together with adjustment terms. 
With such a multiplier the function to be minimized becomes 


L, = >> dX a — a, — bt)? + 2a. — a + ¥(b2 — 5,)], 


whence, computing the partial derivatives, normal equations are ob- 
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tained which can easily be reduced to the form 


3. (yt Oe OA ae (2,3) 
~ 


a, = Yr. — Dy. = (—1)"'A/m, ) (4,5) 


where y,. = dor Yrr/mM, , etc., m, being the number of observations in 
the rth phase. The remaining normal equation is (1). Subtracting (4) 
from (5) and using (1) gives 


A= wld + da. —y) = bee. = yi, 


where w = mm2/(m, + ms) = m,m,/m, say, and d = Yo, — Mn. - 

Substituting for a, , d2 , A as given by (4) to (6) in (2) and (3) gives, 
upon writing Cz,y, , Cx? , etc., for the ordinary corrected sums of 
products and squares, 


€.¢.Ci.= do (Hin "4 i-)) pag ebeur 


(C2? + war. — y)7Jbi1 — wai. — 7) (te. — 7) be 

= Cry, —wda. — ¥), 
— wt, — ¥) (22. = ¥)b, + [Cz =5 W(xo. ory 7)" Jbe 

= Cry. + w d(a2. — ¥)- 


Having solved these equations for b; and b, it is easy to obtain 
values for a, , 2 , \ if required from (4), (5) and (6). This constrained 
fit is then compared with the unconstrained fit for which of course, 
the slopes b’ , (r = 1, 2) are given by bf = Cz,y,/Cx; . 

If there are m = m, + m, observations, the sum of squares for 
departures from unconstrained regressions has m — 4 degrees of freedom, 
two regression coefficients and two means having been estimated. For 
the constrained fit, the corresponding sum of squares has m — 3 degrees 
of freedom since only three independent parameters have been esti- 
mated in view of (1). It is easily shown that the difference, with one 
degrees of freedom, has a sum of squares given by 


> (bf — b,)Cz,-y, + dd. (7) 
This sum of squares may also be obtained by subtracting 
2 >> b,Cx,y, — >, b?Cx? — ?/w 


from the unconstrained regression sum of squares. It is worth noting 
that the constrained lines will no longer in general pass through their re- 
spective centres of mass. 


TWO PHASE REGRESSION LINES 639 


An F-test of (7) against an appropriate error variance is required. 
A significant value indicates a worthwhile improvement in fit if the 
constraint is removed, and in that event BI would normally be re- 
jected for the y under consideration. 

The above test may be extended if there are n sets of data, and 
given change-over values y, , --: , Y, postulated, each one associated 
with a given set. The regression coefficients may be estimated separately 
for each set by the procedure above, using the appropriate constraint 
each time. If desired, a joint test of the fit compared to unconstrained 
fits for all sets within each phase may be made. The sum of squares 
(7) is computed for each set, and these are added together to give a sum 
of squares with n degrees of freedom which can be tested against an 
appropriate error variance. If y, = --- = y, it will be noted that this 
test is essentially a test of DI with 8 infinite. 

Turning now to DIT there is little difficulty in developing a test 
if the change-overs are assumed to lie on a line of infinite slope (i.e. 
to occur at the same unknown z-value) providing AJJ and CT are 
accepted (i.e. all lines within each phase are parallel to one another). 
It will be assumed that all lines in the first phase have the same slope 
b, , and all lines in the second phase have the same slope bz , (bi¥ bz). 
A comparison has to be made between a fit constrained so that all 
change-over points lie on a line z = 7’, and a fit not so constrained. 
(Note y’ is not specified, and must be estimated.) 

The model now is 


E(y) =a, + 6:2 and Ey) = as, + Bot 


for the respective phases, where s = 1, --- , m, and there are n con- 
straints which may be written 
Os) ag (b> — fy) — 0. (8) 


As B; , 82 and y’ are independent of s, it is convenient to replace (8) 
by the 7 constraints 


Qe, — 15 a Mis 


the geometrical significance of which, in relation to intercepts, is obvious. 
The function to be minimized in order to estimate these parameters 
(including y in this case) is 


vey = Se Dy S Ore rae As << b.2ren) ate 2 eS NilG@e a Ais =— “, 
t 8 k 8 


the \, being Lagrange multipliers. Differentiation gives rise to a set 
of normal equations, e.g. 
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De >> OLA EEE: — AysUrsk — bare) = 0 
8 k 
des = Yrss Dae ae (—1)"A,/m,s 5) 
Dads 
Qo, — 1, —c= 0. (9) 


These lead, after some straight-forward but tedious algebra to the 
equations 


Ca w¥2.. re, Cate At Geiti.. ie Disp Beie dhe (10) 
[07 Cont bp — Cyn Ce ee (11) 
—(,01.t0.0, + [>> Cxz, + Cutz le = >) Casa. + Catena (12) 


where Cx”, , Cz,,y,. are the usual corrected sums of squares and pro- 
ducts for the sth line of the rth phase, and 


i, = DO Dee 

ie = oes 
We = My, Moe) (Mig — Ma,) 

On =) Dope Ose 
Cla. ty = >> WeL1s.0os. — los WDetrea ie 
Citi = os W,.41,;. ds — ees w, d,) 

= Bo k= Ute ay Cue 


In the above all summations are over s. 

It is easiest to solve (11) and (12) for b,; and b, , then obtain ¢ from 
(10). For testing purposes it is convenient to know the , , which can be 
obtained from the easily deducible relations 


Ns = wld, — bets. FO Oo) GES) 


Putting m = SE Do m,, , the total number of observations, then 
for unconstrained parallel lines the sum of squares for deviations from 
regression has m — 2n — 2 degrees of freedom. When the constraints 
(8) apply we fit only » + 3 independent parameters and thus have 
m — n — 3 degrees of freedom for deviations from regression. The 
difference between these two deviation sums of squares has n — 1 
degrees of freedom, and is obtained as 


> uy, (by o b,)CLpeYrs “ie Ss NO. ) (14) 
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where b; , bf are the regression coefficients for unconstrained parallel 
lines. 

Equations (11) and (12) simplify somewhat if the 2’s are the same 
for ali sets of data. 


4. EXAMPLE 


R. M. Fulford (personal communication) has made counts of the 
number of nodes in terminal buds of spurs of the apple variety Miller’s 
seedling from the first or outermost budscale to the youngest. pri- 
mordium at intervals after the application of three defoliation treat- 
ments. The treatments were applied on different dates, and the counts 
were made at intervals of seven or more days throughout the 1957 
growing season. On each counting date at least six buds from trees 
that had received a given treatment were examined. The counting 
dates were not always the same for all treatments, but were generally 
close together and covered a comparable time span. 

The regression of y, the number of nodes, upon x, the number of 
days after May 24th, was found to be consistent with the hypothesis 
of three sets of two-phase parallel regressions (i.e. AJZ and CI were 
satisfied). This was tested by standard methods which are not re- 
produced here. It appeared from the data that the change-overs might 
well have occurred on the same calendar date, rather than at, say, the 
same fixed number of days after the application of each treatment. 
In each phase of each regression there were observations for between 
six and eleven distinct counting dates z. 

The raw data are not reproduced here; but relevant means, sums 
of squares and products, weights, etc., are given. 

The content of Table 1 was computed directly from the data, and 
that of Table 2 from that of Table 1. 

The coefficients in equations (10) to (12) are easily got from these 
tables, and give 


c = 8-260 — 97-856b, + 39-578), , 
56926-21b, — 777-54b, = 11388-08, 
—777-54b, + 19057-96b, = 954-93, 
whence 0, = 0.2008, b. = 0.0583, ¢ = 10.502. From (13) we then get 
A, = —4:404, Ae = —5-6ll, Az = 10-017. 


The A, , apart from rounding errors, sum to Zero, providing a useful 
check. Computing (14) we find it comes to 7.32. This sum of squares 
has 2 degrees of freedom. An appropriate error mean square is provided 


& LO Vis AY SO vo LLL S65Gc5 18° €L11 dS 10 Gg pejoes109 pozYysTo 
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A 18 P7SSEs TG 099TS LE SZ99GS 8h 967FE9 LE" S6LE01 WOT}OOIIOD PozYs1o \\ 
= 
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by that between counts on the same date within treatments and has 
the value 1.08 with 230 degrees of freedom. This routine computation 
is not shown. The appropriate F is thus 3.66/1.08 = 3.39, and this is 
significant at the 0.05 probability level. Thus there is evidence for 
rejecting the hypothesis of a common change-over date, although it 
was clear from the data that the change-overs occurred nearer to a 
common calendar date than they did to a constant time lapse after 
application of each treatment. In the computations, particularly in 
calculating (14), care is needed to avoid excessive rounding errors. 


5. RELATION TO FIELLER’S THEOREM 


It is evident that a test of consistency of the observed change-over 
value with a specified x = y, (hypothesis BJ) is equivalent to a test 
about the intersection of two regression lines, and the test developed 
in Section 3 might be expected to be related to Fieller’s [1940, 1944] 
theorem on fiducial limits of a ratio. Fieller has shown that the fiducial 
limits for 6 = y/zx, where the errors in y and z are normally distributed 
with zero mean, are given by the roots of the quadratic in 0 


(y? — tv,,) — 20(zy — fr.) + O(@’ — De) =O. 


where v,, , Vez , Vzy are estimates of the variances and covariance of y 
and x, each based on n degrees of freedom, and ¢ is the appropriate 
value of Student’s t for the required fiducial probability with n degrees 
of freedom. 
Applying the theorem in the situation envisaged in BI, Fieller’s 
y = a, — af, andhis = b{ — bj, in terms of the parameters of un- 
constrained regression lines. The estimated variances and covariances 
of these, using an appropriate error mean square s’ say, are easily 
obtained. On the criterion of Section 3 for testing BI, a chosen y will 
just be significant if 
> (bf — b,)Cz,y, + rd — ts’ = 0. (15) 


From Fieller’s theorem it follows that such a y is given as a root of 
the quadratic in y, 


[y(bt — BS) — (af — at)?’ — bs 
[i4 ear ea] -o. (16) 


The left-hand side of (15) can be shown to be a quadratic in 7, 
and thus (15) and (16) will be equivalent if it can be shown that their 
left-hand sides are identical for three distinct values of y. This is 
easily done when y = 1, OF %2, OF (a, — af)/(bi — 52). 
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Thus using Fieller’s theorem to establish fiducial limits for y pro- 
vides a set of y’s (i.e. those between the limits) which would be accep- 
table under the hypothesis test put forward here. 

With either Fieller’s theorem, or the approach used here, difficulties 
arise if y is infinite, but in practice this causes no trouble in the present 
context as an infinite y would imply the identity of the two phases, 
i.e. acceptance of AJ, or alternatively a pair of parallel regressions, 
rather than the abrupt change of slope implied by y finite. 

Whilst Fieller’s theorem does not link directly with the test already 
considered for a common unspecified change-over date for all of n sets 
of data, it does provide a solution to a closely related problem, namely: 

Assuming there is a common-change over date, what are the p 
percent fiducial limits for this? Equations (10) to (12) provide 
estimates c, b, , bs , of y, 6: , Bz respectively. The y’ of (8) is related 
to y by the expression 


y= (8, 283)’, 
thus y’ may be estimated as 
c’ = c/(b, — bp). (17) 


If the value of c given by equation (10) is put in (17) and the variances 
and covariance of numerator and denominator obtained in the usual 
way, Fieller’s theorem may be applied. 


6. DISCUSSION 


An important characteristic of the tests derived in Section 3 is that 
they involve linear constraints upon the parameters. This would not 
be the case for instance if one wished to test DIZ with 8 infinite but 
without the restriction to parallelism introduced in Section 3. Then 
the n constraints would become 


Qos a Ais a c(by, pas Dee 


all terms being unknown. Further, if one wished to test DIZ with 6 


finite, and no restriction to parallelism, the constraints would take the 
form 


Orde 7 bo s@1. = a(b;, es bss) = B(d2, ayo Gays 


These do not easily reduce to a workable system of linear constraints 
and we are consequently led to non-linear normal equations. It is 
seldom that such equations can be easily reduced to a linear set, and 
iterative solutions or other special methods are required. Similar 


difficulties arise with many hypotheses associated with problems of 
type D. 
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An alternative approach to questions of linearity of the change-over 
points might be via the fitting of lines to the observed unconstrained 
change-over points. As these will almost certainly be determined with 
different accuracies for different sets of data depending upon the numbers 
of points, and their positionings, in each phase, some weighting process is 
indicated, and the approach is entirely different from that adopted here. 

Finally, from the biological viewpoint it must be emphasised that 
the ability to use techniques of the type suggested here in no way 
reduces the need for the experimenter to consider the biological reason- 
ableness of the model he uses, and whether it is more than a convenient 
approximation. 
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165 NOTE: A Practical Application of a 
Theoretically Inefficient Design’ 


C. P. Cox? 


National Institute for Research in Dairying 
University of Reading 
Reading, England 


1. INTRODUCTION 


In experiments on milking machine techniques for reducing the 
time taken to milk each cow, it was desired to compare milking times 
given by combinations of two pulsation ratios, P, and P, with two levels, 
V, and V, , of vacuum in the machine line. It was expected (see Sec- 
tion 3) that the animal to animal variability would differ substantially 
for the different treatments, in view of which, and the short time avail- 
able for the experiment, it was decided to limit the experiment by 
having only two treatments per animal. The main restriction de- 
termining the design arose because the treatment combination PVG 
was of such special interest that it was desired to obtain an estimate 
of the corresponding milking time from all the available animals. The 
simple solution used nicely illustrates the dual features of estimation 
and discrimination which often interplay in biological operational 
research. 


2. DESIGN PRINCIPLES 


Each animal formed an incomplete block receiving two of the four 
possible treatment combinations with four successive observations on 
each. The special feature is that, to provide the estimate required, 
all animals received the combination P,V, and practical considerations 
made it necessary for this to be the treatment for each animal in the 
first period. In theory therefore, the treatment differences would be 
confounded with any period differences but, in this case, previous 
experience had shown that these could safely be discounted in such a 
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short experiment. It was logical therefore to allot the three remaining 
treatment combinations equally, and randomly, to the animals in the 
second period. Thus, if P,V, is regarded simply as a control, the design 
itself is that properly deprecated by Yates [1936] for cases when treat- 
ment differences only are required, because of its inefficiency relative 
to the arrangement using all the six possible pairs. This latter design 
was inadmissible here because of the primary estimation requirement 
and it will appear that, because every observed difference contributes 
to each, the main effects of treatments and the interaction are estimated 
with sufficient accuracy to give discrimination at a useful practical level. 


3. ANALYSIS 


Let d;; be the difference between the two period means for the jth 
animal on the treatment arrangement distinguished by the suffix 7, as 
follows; 


period 1 2 

Bi il PiVy — P3Vi 

ae, PiV, — PiVe 

—Fs PiV, — Po: 
and j = 1, --- , n for each 7. 


Let d;. be the mean of d;; over the n animals receiving the 7th 
arrangement. Then the treatment comparisons are estimated as: 


d,./2 d2./2 d;./2 
main effect of P = 34(P2 — Pi)(Vi + V2) —1 1 =i 
main effect of V = 3(P: + P2)(V2 — V1) 1 —1 —1 
interaction PV = 4(P2 — Pi)(V2 — V1) 1 1 —1 


Hence, if o; is the variance of an individual d;;, each effect and the 
interaction is estimated with variance (oj + a3 + 03)/4n. 

With 7 animals in each of the three 7-groups separate analyses of 
the d;; are obtained for each group according to the simple partition: 


treatment difference 1 
between animals (n — 1) 


to which a within animals term may be added if required. 
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In the example, the mean differences d;. together with the between- 
animal mean squares with n = 8 were: 


treatment difference ake. mean square (7 df.) 
PV, — Poi —0.244 0.1467 
PiVi — PiVe —0.638 1 5225 
PV. — PoV2 —1.631 1.9339 


We thus obtain the main effects and interaction, in minutes, as 
P = 3(0.244 — 0.638 + 1.631) = 0.619, 

and similarly, 
V = 1.012, PV = 0.375. 


The appreciable differences between the above three mean squares 
are noteworthy and were expected because milking times are functions 
both of the machine variables and their interactions with highly in- 
dividual characteristics such as the amounts of milk secreted, neuro- 
hormonal responses and anatomical details of teats and udders. 

The variance of each main effect and of interaction PV is calculated 
as 

(0.1467 + 1.5225 + 1.9339) /32 
giving a standard error of 0.336 minutes. 

Since each mean square has the same number of degrees of freedom, 
the ordinary f-value for seven d.f. can be used for an approximate test 


of significance (after Cochran and Cox, [1957]) showing here that only 
the V-effect is significant at the five percent level. 


4. DISCRIMINATION VERSUS ESTIMATION 


It has been pointed out that the estimation requirement is fulfilled 
at the expense of the accuracy achieved for the treatment comparisons. 
Had a design employing all pairs of treatment combinations been 
applicable there would have been three more treatment groups, 


DO = 4 Ee Va .< 12. Ws ) 
Is: eevee ae Ve 
ie Panel eed 


D 


I 


a 


The main effect of P, for example, calculated without recovery of 
inter-animal information, is then, 
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Fig h(a, SP CORSE Sects 


and, instead of n, there will now be n/2 animals in each group. 

The variance heterogeneity in the particular example complicates 
the direct comparison but, if in a general case, transforming if necessary, 
we assume that each d;; has variance o°, the variance of a treatment 
comparison is now 


HS 
bo 
Si 


II 
9 
iw) 
> 
bo 
= 


16 n 


which compares with 30°/4n obtained when only three differences are 
directly investigated. 

Both designs provide unbiassed estimates of the milking time on 
P,V, and the above loss of discrimination is compensated by the fact 
that this mean milking time is estimated with only half the variance 
which would have obtained if all the paired differences had been used. 

Finally, with reference to the particular example, when not only 
the mean but the distribution of milking times on the P,V, combination 
is of interest, the larger numbers of observations obtained with the 
design used will be additionally advantageous. 
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166 QUERY: On a Graphical Sequential Test 


Bross [1952] describes a graphical sequential test, and among the 
plotting rules is the instruction that “aif new and old treatments lead 
to the same outcome then no information about superiority is obtained 
and nothing is plotted on the chart’. From his chart therefore dis- 
tinguishable results are considered significant whether they are 6 out 
of 6, or 6 out of 100 in the other 94 of which both treatments gave the 
same result. If one treatment kills 100 patients, and the other kills 
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94 and cures 6, normal probability ideas seem to suggest that there is 
only a 6 percent chance that there is some difference between the 
treatments, and not that it is less than 5 percent that there is no differ- 
ence. Is it wise to ignore some of the experimental results? Should 
the instruction be reworded “‘if the fact that the outcome of the new 
and old treatments is the same provides no information towards the 
particular problem being investigated, nothing is plotted on the chart: 
if it supports a hypothesis that the treatments are not different, then 
the experiment is plotted in the square adjoining the last plot diagonally 
away from the origin’’? 


REFERENCE 
Bross, I. D. J., [1952]. Sequential medical plans, Biometrics 8, 188. 


ANSWER: 


Although the query concerns a sequential procedure it applies 
equally well to the fixed sample procedure—the Sign Test—which is 
the parent of this sequential procedure so let us look at the simpler 
example. Both the Sign Test and its sequential analogue would be 
based on a study plan where one member of each patient pair is randomly 
allocated to treatment A or B (the second member receiving the re- 
maining treatment). When both patients in the pair have the same 
response (both live, both die), the “‘tie’’ is ignored insofar as the sig- 
nificance test is concerned. Thus in the example of the query the test 
would be based on the six pairs where the patient on treatment A 
survived and the patient on B dies (and the zero cases with the opposite 
results). The Mosteller-Corrected Sign Test would be (6—1)?/6=4.17. 
This exceeds 3.84 so the results would be significant at the 5 percent 
level. 

Is it wise to ignore some of the experimental results (eg. the “‘ties”’)? 
Let me make it clear that the “‘ties’’ would not be ignored in other 
phases of the analysis which deal with the clinical importance of the 
difference between treatments. The question, then, is whether “ties” 
should be ignored in dealing with the specific point: Is there any differ- 
ence in the responses to the two treatments? There are two technical 
approaches which will provide an answer to the question. First we 
might ask: Does ignoring the ties impair the efficiency of the procedure? 
The technical answer is that the loss of efficiency is negligible. Second 
we might ask: If significance tests are set up which involve the tied 
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observations, will they be better than the Sign Test? The technical 
answer is that there is no advantage to the alternative procedures and 
sometimes they result in a loss of power. While both technical ap- 
proaches lead to the answer ‘‘ties can be ignored” they may not satisfy 
an investigator who would like a simple rationale rather than a formal 
proof. 

Therefore I will indicate a rationale for the clinical situation. The 
purpose of therapeutic intervention is to change the natural clinical 
course of a disease in a favorable direction. Let us suppose treatment 
B fails to change the natural course (i.e. to death) while treatment A 
occasionally can change this course. Then the number of changes 
achieved on treatment A constitute the evidence of a difference between 
the two treatments. If there are 6 changes, we have the same amount 
of evidence of a difference whether it comes from 6 pairs or 600 pairs. 
If there were some spontaneous remissions (eg. not due to treatment), 
then we would have to compare the number of changes in the two 
series but this information’ would constitute the available evidence of a 
difference between the two treatments (irrespective of the number of 
pairs). When our interest lies in change of status then the ‘‘ties’” are 
irrelevant. 


I. D. J. Bross 
Roswell Park Memorial Institute 
Buffalo, New York, U.S. A. 


107 NOTE: A Simple Method of Fitting the 
Regression Curve y = a + 6x + Bp" 


B. K. San 


Department of Statistics 
University of Baroda, Baroda, India. 


Patterson [1956] described a simple method for fitting an asymptotic 
curve 


y=at Be, ity eels (1) 


for n = 4, 5, 6 and 7 equally spaced ordinates. In this method p is 
estimated by r, the ratio of two linear functions of y’s and a and # can 
be obtained by the linear regression of y on r’. 

This method can be extended to the problem of fitting the curve 
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yaaa 400 + Bp. (2) 


for equally spaced ordinates. Here also the estimate of r is a ratio of 
two linear contrasts of the y’s and the estimates a, 6 and 6 are calculated 
from a multiple linear regression of y on x and 7°. 

As four parameters are to be estimated, the above method gives an 
exact fit form = 4 using the formula 


Ys = 2Yo ce Ya (3 
Yor) 2Y peeve ) 


The following formulae are suggested for n = 5, 6, 7 and 8: 


dys — 4Y3 — TY2 + by 
t= = 4 LOL! Wino; (4 
5y3 — 4y. — (yi, + 6Yo ) 


= ae 400 NS=- 6; 5 
: 10y, — ys — 13y. — Ly, + Loy (6) 


— 10y. + ys — bys — 14ys — 8y2 + 17 ra 
oa 1095 ys — Oy 1A Sy a pera ie (6) 


and 


SLO ya om 5ys — 12y, — lly; — Tys + 20y, 
10ys + 5y; — 5y4 — 12y3 — lly. — Ty, + 20Yo 


, tor n= 8. 7) 


Then four formulae are chosen to give a reasonably high efficiency for 
independent y’s with equal variances. The percentage efficiencies for 
n = 5,6, 7 and 8, calculated from large sample formula for the variances 
of r and the efficient estimate (Shah and Patel, [1960}), are: 


p Ne =D (me © if, SH ai 
0.0 95.2 91.3 88 .4 86.4 
Ome 98.4 97.3 96.0 94.4 
0.2 99.7 99.7 99.0 97.8 
0.3 99.9 99.7 98.4 96.9 
0.4 99 .6 98 .4 95.5 93 .0 
0.5 99.0 96.1 91.4 87.5 
0.6 98.1 93.7 87.1 82.6 
On 97.3 91.4 83.0 onl 
0.8 96.3 89.4 Mal 71.8 
0.9 95.4 86.7 75.9 68.2 


Thus the efficiencies of the proposed estimates are of about the same 
order as the efficiencies of Pattersons estimates of p for equation (1). 
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168 QUERY: On a Follow-up Study of the 


Growth of Children 


In conducting a long term follow-up study of the growth and de- 
velopment of a sample of children, the question arises as to whether 
losses between birth and 4 years were such as to bias the remaining 
sample. Two specific questions may be asked: 

(a) Are losses independent of social class, age of parent etc? Leavers 
and stayers were compared, on several variables, using the Brandt- 
Snedecor method for a 2 X k table. 


Example 
Mother’s age Proportion 
at child’s Initial Stayers | remaining 
birth Sample Leavers (X) (p) px 
up to 24 49 18 dl 6327 19.6137 
25-32 tity 25 92 7863 72.3396 
33+ 55 6 49 8909 43.6541 
Totals 2A. 49 172 p = .7783 > pX = 135.6074 
p > X = 183.8676 
diff. = 1.7398 


> = (> pX — p ) X)/aG = 1.7398/.1725 


x 
I 


10.09 


(01 > P > .001). 


With several parental and social variables, x’ indicated significant 
_ differences between leavers and stayers. 
(b) What is the probability that the remaining sample (stayers) 
could arise by random sampling from the original sample? 
This would involve not a test of independence between staying and 
mothers’ age, but a test of the difference between observed frequencies 
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among stayers, and expected frequencies based on the proportions 
in the original sample. 

Can x? be used to answer this question? It would seem illegitimate, 
in so far as expected values would themselves be derived from sample 
values, and not from population values. 

Douglas and Blomfield (1) employed a x’-test in this way, but in 
their case the initial sample consisted of all children born in a particular 
week in the majority of local authority areas of England and Wales. 
They derived expected frequencies in social class categories from the 
proportions in this sample, and applied them to the stayers. Is this 
legitimate, in so far as their expected frequencies might be said to be 
derived from population values? 

If the x’-technique is inapplicable in my case, is there any other 
way of assessing how unrepresentative the remaining sample has become? 


REFERENCE 


Douglas, J. W. B. and Blomfield, J. M. [1956]. The reliability of longitudinal sur- 
veys. Milbank Mem. Fund Quart. 84, 227-252. 


ANSWER: 


Since the original sample was random, the two questions, “Are 
the proportions of ‘leavers’ and ‘stayers’ effectively the same in the 
different rows?’’, and “‘Can the sample of stayers be regarded as a random 
sample of the original population?” are equivalent. The x’-test answers 
the question “Could the samples in all rows (or all columns) of the table 
have arisen by random sampling from a single population?”’, which is a 
slightly more specific version of the first question above. Since for 
your data x’ is significant, we are led to conclude that there is a genuine 
difference in the proportions of mothers of different ages in the sub- 
populations of ‘all stayers’ and ‘all leavers’ (i.e. of all mothers in the 
population who would have been classified under these headings had 
they been included in the sample): since the original sample was random, 
the sample of stayers must be regarded as having been drawn at random 
from the sub-population of all stayers, and so cannot be regarded as 
randomly drawn from the whole population. No independent test of 
this is necessary, or indeed possible: since the two questions are logically 
equivalent, the same test must be used to answer both. Admittedly 
it appears at first sight that a different test could be derived by com- 
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paring the observed numbers of stayers with their expectations, using 
the statistic 


Tea fee ACO 
NF » Var (XE) ) (1) 


where X, is the number of stayers in the th class, and the expectation 
and variance of X; are calculated on an appropriate null hypothesis. 
This statistic does not appear to depend on the numbers of leavers: 
however, since the true population proportions are here unknown, this 
test can only be based on the null hypothesis that the X, have arisen by 
independent random binomial sampling from the small numbers n; 
of the original sample, with a common probability estimated by 7. 
Hence Var (X,) must be estimated by n;pq, so that formula (1) yields 


ae Ss [X; = pl 

NiPq 
which, after a little manipulation, reduces to precisely the form that 
you have already used to calculate x”. Only if all the values of p are 
very small, so that g can be taken as effectively equal to 1, can formula 
(1) be approximated by the expression 


See tea 
— 25) o 


? 


x 


For your data g = 0.2217, so that the use of formula (2) would be 
illegitimate, as you conjecture, and would give a x’ less than one- 
quarter the size of the correctly calculated value. As far as I am able 
to judge from the information given in their paper, the test used by 
Douglas and Blomfield is in fact invalid for this reason. 

Having once decided that leavers and stayers do differ we may ask, 
quite legitimately, the further question, ‘(Is the non-randomness of the 
stayers, regarded as a sample of the original population, sufficiently 
serious to produce Jarge biases in the estimates to be calculated from 
the sample?” If the overall proportion of leavers were small, the 
disturbances introduced by any variation in rate of loss might well be 
negligible in comparison with the original sampling variation, so that 
the sample of stayers could be treated as though it were a random sample 
from the population: the resultant estimates would be biased, but the 
_ bias would be unimportant. If the true proportions in the various 
classes in the population are known, x” may be calculated from formula 
(2): it must not be regarded as a test statistic, since the appropriate 
null hypothesis has already been rejected, but its size may be used as a 
criterion of the degree of non-randomness of the sample of stayers. 
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If the population proportions are unknown, as in your example, some 
indication of the seriousness of the risks involved in treating the sample 
of stayers as random may be obtained by considering the changes 
produced by leaving in the proportions of the sample falling in the 
various groups. For example, in the results quoted by Douglas and 
Blomfield the largest change in any employment group was in that of 
‘manual workers’, which comprised 0.691 of the original sample, and 
0.696 of the sample of stayers. Clearly, even though the authors 
were in error in maintaining that the sample of stayers could be regarded 
strictly as a random sample from the population, the degree of bias 
introduced by so regarding it was trivial. In your data, on the other 
hand, the proportion of mothers under 25 has fallen from 0.222 in the 
original sample to 0.180 in the sample of stayers, a much more serious 
change. 

Biases resulting merely from the changes in proportions can be 
largely avoided, in theory, by calculating separate estimates for the 
different groups, and combining them according to the proportions of the 
original sample. However, a breakdown of the sample taking into 
account all criteria that show significant changes in proportions may 
well leave you with groups too small for satisfactory estimation, and 
this method cannot allow for the further danger that the leavers will 
almost certainly be non-typical of the population even within groups, 
so that a considerable bias may remain even though the effects of 
disproportionality are successfully removed. 

M. R. SAMPFoRD 


A. R. C. Unit of Statistics, 
Aberdeen, Scotland. 
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DE JONGE, H. (editor). Quantitative Methods in Pharmacology. Amsterdam: 


16 
NordeHolixnd Publishing Company, 1961. Pp. xx + 391. £4.4s. 


D. J. FINNEy, University of Aberdeen, Scotland. 


In May 1960, the Netherlands Region of the Biometric Society organized a 
Symposium for the Society. This volume contains the texts of the 26 papers pre- 
sented, together with summaries of discussions and of remarks by chairmen. The 
papers differ greatly in interest and importance, and any selection for comment 
here may reflect the taste of the reviewer as much as the quality and balance of the 
Symposium. 

The first section of the book contains papers on sequential trials of drugs. 
HAJNAL presents an excellent account of sequential tests of four analgesics against 
a control, where assessment is in terms of a continuous variate and estimates of 
effects are wanted. JOHNSON introduces an important subject in a short paper on 
how to choose a sequential procedure that will meet practical requirements. RUMKE 
suggests uses of sequential techniques for estimating doses to be used in a definitive 
experiment; his idea is incompletely worked out, and questions of efficiency are not 
discussed, but it may give rise to improvements in the strategy of using pilot experi- 
ments. 

The second section, on drug standardization, includes a theoretical paper by 
CLARINGBOLD and EMMENS, consisting of a formal account of computational 
procedures appropriate to multivariate quantal responses under a general trans- 
formation, of which the normal equivalent deviate is a familiar example. Numerical 
illustration might have been helpful. Additional evidence is given that many types 
of data are insensitive to the choice of transformation, and the angle transform is 
therefore commended for general use. VAN STRIK’s suggestion of using rank as a 
metameter for semi-quantitative responses may sometimes be helpful, but the 
method fails completely in two special cases and one may suspect that other rank 
orders close to these extremes will give trouble. 

Evidently the most controversial session of the Symposium was that on non- 
parametric methods. HEMELRIJK and VAN DER VAART contribute papers 
on the efficiency and robustness of Wilcoxon’s test; both are of considerable theoreti- 
cal interest, though neither recognizes that the interpretation of numerical obser- 
vations can seldom be completed solely by appeal to tests of significance. VAN 
EEDEN’s ingenious but misleadingly titled paper illustrates this well; her method 
of estimating some points of the distribution function for a tolerance distribution, 
without assumption of any specified functional form, can evidently be extended to 
provide a method of estimating relative potency from two series of records which 
will work for some data, though for others it may be ambiguous. At best, it must 
be indeterminate to the extent of unit interval on the dose scale, and the author’s 
hopes of finding a method of assigning a measure of precision to such an estimate 
seems at present to lack foundation. JUVANCZ’s entertainingly vigorous attack on 
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non-parametric methods perhaps relies too much on what users of statistics have 
done rather than on what they ought to do, but comes closer to the needs of scientific 
research. In “‘A Visual Approach to Bio-Assay,”’ GOLDBERG presents an ingenious 
system of graphs and scales that enable a close approximation to the maximum likeli- 
hood estimation of parameters for a quantal response regression to be achieved by a 
series of additions. The account of this, even for the single parameter exponential 
model, is so complicated as to make this reviewer prefer the simplicity and speed of 
standard maximum likelihood calculations. 

The sections on drug screening and on mixtures of drugs are of consistently high 
standard. DUNNETT and SCHNEIDERMAN each discuss problems of evaluat- 
ing alternative screening rules with emphasis on economic aspects. Dunnett con- 
siders carefully the components of cost and loss that must enter into the specification 
of an optimal screening programme for a large number of drugs by appropriate 
decision rules; Schneiderman concerns himself more with the operating character- 
istics of his sequential procedures for screening anticancer drugs, and includes inter- 
esting comments on the use of a well-instructed computer. BROCK and SCHNEI 
DER seek to rehabilitate the therapeutic index by more careful definition than is 
customary; consideration of optimal experimental planning for the estimation of 
their index might throw more light on the seriousness of the loss in precision inherent 
in working at 5% and 95% response rates. Alternative, but not necessarily con- 
flicting, views on quantal responses to mixtures of drugs are discussed by ARIENS 
and SIMONIS and by HEWLETT and PLACKETT. The comprehensive mathe- 
matical models of the second will have the greater appeal for the statistician; for the 
pharmacologist, this summary of metrical aspects of Arién’s receptor theory should 
prove valuable, and perhaps the conjunction of the two will stimulate an effective 
synthesis. DIKSTEIN’s short account of a hypothesis relating chain length of 
primary and monoquaternary amines to excitation of guinea-pig muscle must surely 
be the forerunner of much work on the relation of physicochemical properties and 
pharmacological action. DE JONGH gives a useful and simple summary of the 
isobole method of representing different types of action of drugs in mixture. 

Inevitably a symposium of this kind includes a group of papers that defy any 
classification other than ‘‘miscellaneous”’, but those in the present volume are not 
the less interesting for that. GURLAND’s suggestions on bioassays for estimation 
of minute insecticidal residues include useful practical ideas for taking account of 
the masking of potency by inactive materials present in plant extracts and for the 
“fortification” of test extracts of very low potency so as to permit adequate assays 
when available quantities are severely limited. Finally, a contribution from 
BILLEWICZ describes a statistical study of a technique of measurement (the 
falling drop technique for estimating the concentration of deuterium oxide): al- 
though in itself purely physical, this is a good example of the manner in which a 
statistician can help to improve the accuracy of a laboratory technique. 


7 Le ROY, H. L. Statistische Methoden der Populationsgenetik. Stuttgart: 
Birkhauser Verlag, 1960, pp. 397, 67.5 Swiss Francs. 
C. Harts, Universitdt zi Kéln, Germany. 


This book contains an extensive description of statistical methods used in 
modern population genetics. The reader is assumed to have some knowledge of 


BOOK REVIEWS 659 


genetics and general biometrical methods, especially correlation methods and the 
analysis of variance. The first chapter deals with the influence of environment, 
genotype, and the combination of both on the expression of the phenotype. First, 
statistical and biometrical models are constructed. Then statistical methods 
are explained, and the use of the formulae demonstrated by numerical examples. 
The cases discussed are simple allelism, multiple allelism and polygenic inheritance. 
On this basis the next chapter gives the statistical treatment of the interdependence 
of genetical and environmental factors. Here the method of path coefficients is 
introduced, and by this method the analysis of correlations and interactions for 
different degrees of relationship is explained. 

The third chapter deals with the same problems, but here they are analysed 
by means of the analysis of variance for the cases of a hierarchical and a two-way 
classification (incomplete and complete analysis of variance). The author gives both 
the method of analysis and the interpretation of the analytical results. The methods 
apply mainly to animal breeding, but in addition the results are generalised by dis- 
cussing the special problems which arise in plant breeding and human genetics. 

In the fourth chapter the methods given in Chapters 2 and 3 are applied to the 
special case of artificial selection in respect of one or more characteristics. Here not 
only the classical designs but also modern methods are considered. The author com- 
pares regression analysis, the method of path coefficients and the analysis of vari- 
ance. Different methods of treatment are also compared with respect to their effi- 
ciency and the amount of information they can give. 

In all chapters the formulae which are used are derived in a valid manner and 
the methods of analysis are illustrated by good examples. In several places there 
are diagrams which help to make the complex connections quite clear. The methods 
of analysis are clearly illustrated by detailed numerical examples. At the end of the 
book there is an extensive list of literature, which not only contains the papers 
mentioned in the text, but is in fact a bibliography on all the questions raised in the 
course of the discussions. In this way it is easy for the reader to find references in 
his own field of work on special questions which are not given a place in the book. 
The last pages give, besides the index and a list of authors, three very instructive 
tables. Their two-way classification according to problems and methods makes it 
possible to see the connections between the chapters and to locate the questions 
which are discussed under different topics from different points of view. 

The reader who knows enough in advance to follow the author and who has 
enough energy to work through nearlv 400 pages, will benefit greatly from this book. 


LI, C. C. Numbers from Experiments.—A basic Analysis of Variation. Pitts- 


Bs burgh: The Boxwood Press 1959. pp. 1x + 106, Cloth $4.00, paper $2.75. 


G. H. Jowett, University of Melbourne, Australia. 


This book is mainly concerned with explaining the algebra involved in the 
analysis of variance applied to one-way, hierarchical and two-way classifications, 
and to linear regression. 

The author is clearly fascinated by the partitioning of sums of squares and goes 
to considerable trouble to show how this happens. He makes considerable use of an 
interesting diagrammatic representation of a comparison between sums of squares 
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as a comparison between areas made up of actual geometrical squares, which is 
apparently his own invention. 

Summation and suffix notations are used freely, particularly in later chapters, 
and matrices and orthogonal contrasts are also used, but only to demonstrate the 
splitting up of a sum of squares in an alternative manner. No basic statistical theory 
is given, nor is any serious attempt made to explain other aspects of the mathe- 
matical statistics. Some space is devoted to establishing expectations of mean 
squares, but this is based on quoted theorems of the type, 


Ky + Ye ete +e) = E(y) + E(ye) ae ceay 
E(yy2) = # (y, , y2 independent), 


the plausibility of which is justified mainly by reference to a very special case where 
y can take each value with equal probability. An attempt is also made to demon- 
strate the concept of a linear model; an ingenious example—“‘Grandma’s birthday 
party’’—is used for this purpose, deviations from observed means being interpreted 
as gains and losses in a poker game by the grandchildren, the money being provided 
by Grandma with some interference by Grandpa. Illustrative examples used in the 
exposition are deliberately artificial, being chosen to employ simple numbers and 
thus concentrate attention on relationships and procedure rather than to suggest 
applications. 

Since the book does not cover the meaning and application of the techniques, 
it gives little help to anyone seeking to design or interpret the analysis of an experi- 
ment. On the other hand, it is not a suitable introduction to analysis of variance 
for a student of mathematical statistics or statistical method; it is loose and evasive 
about such important matters as degrees of freedom, testing hypotheses and inde- 
pendence, brings in sophisticated and inadequately explained jargon, and makes 
vague and unhelpful references to advanced mathematical topics (e.g. the rank of 
quadratic forms) in cases where the author has not been able to think of simple ex- 
planations. However, there is material here and there which, selected with dis- 
crimination, could give a teacher ideas for presentation or just tilt the balance of 
understanding for a student having difficulty with the algebra of the technique. 


THE BIOMETRIC SOCIETY 


MERTINGS OF E. N. A. R. 


Annual Meeting, 1961 


The annual meeting of the Eastern North American Region of the Biometrics 
Society is being held during the meetings of the American Statistical Association 
at the Roosevelt Hotel in New York City, December 27-30, 1961. Jointly with 
the Biometrics Section of A. S. A., ten sessions of invited papers and one session 
of contributed papers have been arranged. 


Spring Meeting, 1962 

E. N. A. R. will meet jointly with the Institute of Mathematical Statistics at 
the University of North Carolina, Chapel, Hill, April 12-14, 1962. M. E. Turner, 
Division of Biometry, Medical College of Viginia, and A. F. Bartholomay, Biophysics 
Laboratory, Harvard Medical School, Cambridge, are co-chairmen of the E. N. A. R. 
program committee. H. B. Wells, Department of Biostatistics, University of 
North Carolina, Chapel Hill is handling local arrangements for E. N. A. R. 

Titles for contributed papers should be sent to M. E. Turner, who will then 
supply authors with abstract forms and appropriate instructions. 


Annual Meeting, 1962 


The annual meeting of E. N. A. R. in 1962 will be held during the meetings of 
the American Statistical Association at the Hotel Leamington in Minneapolis, 
September 7-10, 1962. Joint sessions with the Biometrics Section of A. S. A. are 
being arranged. 


W.N.A. R. 


MINUTES OF THE 1961 WNAR BUSINESS MEETING 


The annual meeting of WNAR, Biometrics Society was held at 2:30 P. M. 
June 16, 1961 at the University of Washington, Seattle. William F. Taylor, Presi- 
dent, WNAR, presided. 

The Secretary’s and Treasurer’s reports were read and approved. The WNAR 
student award (consisting of a free year’s subscription to Biometrics) was discussed 
at length and a motion was passed providing that any graduate student with a dual 
interest in the biological sciences and statistics should be considered eligible for the 
award. 

A possible meeting with the AAAS at Denver December 26-31 was discussed 
and the general concensus of opinion was that we should participate in this meeting. 

A motion was passed unanimously that WNAR meet for our annual meeting 
with AIBS at Oregon State University, Corvallis during the last week of August, 1962. 

The nominating committee gave their report and nominations were made from 
the floor. Those nominated were: for President, Carl A. Bennett, for the two 
Regional Committee vacancies, Gordon E. Dickerson, Charles Mode, Henry Tucker 


and Alvin Wiggins. 
The meeting adjourned at 3:20 P.M. Present at the meeting: Becker, Bennett, 
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G., Bennett, C., Berry, Bohidar, Brown, Calvin, Chapman, Hopkins, Leitch, 
Mantel, Nash, Petersen, Puri, Russell, Sandomire, Taylor, Tucker, Vaughan, 
Wiggins. 


WNAR ELECTION RESULTS 


Carl A. Bennett was elected WNAR President for 1962-63 and Charles J. Mode 
and Henry Tucker were elected to the Regional Committee for 1962-64. 


JOINT MEETING OF WNAR AND AAAS 


The Western North American Region will meet with the American Association 
for the Advancement of Science in Denver on December 28, 1961. There will be 
three sessions entitled “Experimental Design,” ‘‘Estimation of Populations (mobile 
and others), and a “Contributed Papers Session.” Members wishing to present 
papers should contact Dr. Franklin A. Graybill, Statistical Laboratory, Colorado 
State University, Fort Collins, Colorado, Program Chairman. 


CHANGES OF MEMBERSHIP 
(July 15-October 15, 1961) 


Changes of Address 

Dr. Daniel J. Baer, 519 Chaucer Road, Dayton, Ohio, U.S.A. 

Mr. C. G. Barraclough, Chemistry Department, University of Melbourne, Parke- 
ville N. 2, Victoria, Australia 

Dr. Austin W. Berkeley, Department of Psychology, Boston University, 700 Com- 
monwealth, Boston 15, Massachusetts, U.S.A. 

Mr. Paul V. Blair, Institute of Enzyme Research, 1702 University Avenue, Univer- 
sity of Wisconsin, Madison 5, Wisconsin, U.S.A. 

Mr. Andre Bodeaux, Faculte Agronomique, Usumbura, Ruanda-Urundi, Belgian 
Congo 

Mr. Neeti R. Bohidar, Statistical Laboratory, Utah State University, Logan, Utah, 
U.S.A. 

Mr. Philippe F. Bourdeau, 370 Prospect Street, New Haven 11, Connecticut, U.S.A. 

Mr. N. Charliers, 199 Bonneville Sclayn, Province de Namur. Belgium 

Dr. J. B. Chassan, Hoffman-LaRoche, Nutley 10, New Jersey, U.S.A. 

Mr. Pan Leang Cheav, 1 rue Lucien Petit, Gembloux, Belgium 

Mr. William P. Chu, 5824 Fairfax Avenue, South Minneapolis, Minnesota, U.S.A. 

Dr. Ellsworth B. Cook, American Society Pharmacology and Experimental Thera- 
peutics, 9650 Wisconsin Avenue, NW, Washington 14, D. C., U.S.A. 

Mr. Tiberius Cunia, 2835 McCarthy Street, City of St. Laurent, Quebec, Canada 

Mr. R. N. Curnow, Unit of Biometry, The University, Reading, Berkshire, England 

Mr. Pierre T. A. Dagnelie, 53 Chaussee de Charleroi, Gembloux, Belgium 

Mr. Marc Dalebroux, 74 Planoy, Biesme-lez, Belgium 

Mr. R. de Coene, 79 rue Lincoln, Bruxelles 18, Belgium 

Prof. Martin E. Dehousse, 88 avenue Maeterlinck, Brussels 3, Belgium 

Dr. B. Diamantis, 2129 Scudder Street, St. Paul 8, Minnesota, U.S.A. 

Mr. H. M. Dicks, South Africa Sugar Association, Experiment Station, P. O. 
Mt. Edgecombe, Natal, South Africa 

Miss Martha W. Dicks, Division of Home Economics, Box 3354. University Station, 
Laramie, Wyoming, U.S.A. 
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Mr. Victor A. Dirks, 12800 Dupont Avenue, 8, Savage, Minnesota, U.S.A. 

Dr. N. R. Draper, Department of Statistics, University of Wisconsin, Madison, 
Wisconsin, U.S.A. 

Mr. Bruce A. Drew, The Pillsbury Company, 311 Second Street, S.E., Minne- 
apolis 14, Minnesota, U.S.A. 

Mr. L. Lee Eberhardt, Box 562, Concord, California, U.S.A. 

Mr. Harvey Eisenberg, 3722 Cedar Drive, Lochearn, Baltimore 7, Maryland, 
U.S.A. 

Dr. Khalil M. El-Kashlan, 16 Green Street, Moharrem Bey, Alexandria, Egypt 

Mr. Roger Firmin, 10 avenue R. Neybergh, Bruxelles, Belgium 

Mr. Andris Fogelmanis, 914 Sunset Drive, Pacific Grove, California, U.S.A. 

Mr. J. Fraselle, rue des Bas-Pres, Salzinnes (Namur), Belgium 

Dr. Seymour Geisser, Section on Biometry, Nat. Inst. of Arthritis and Metabolic 
Diseases, Bethesda, Maryland, U.S.A. 

Dr. Lue Goeminne, 6 Bagattenstraat, Gand, Belgium 

Mr. Richard A. Greenberg, 430 Central Avenue, New Haven, Connecticut, U.S.A. 

Mr. Samuel W. Greenhouse, Section on Statistics and Mathematics, National 
Institute of Mental Health, Bethesda. Maryland, U.S.A. 

Mrs. Fredrica G. Halligan, Warwick Towers, 9 Lafayette Court, Greenwich, Con- 
necticut, U.S.A. 

Dr. Emil L. Gumbel, Columbia University, 413 West 117 Street, New York 21, 
INGLE SUES TAS 

Prof. John Gurland, U.S. Army Math. Research Center, University of Wisconsin, 
Madison, Wisconsin, U.S.A. 

Mr. M. A. Guzman, 10 Dixie Trail, Raleigh, North Carolina, U.S.A. 

Mr. A. M. Haider, Federation of Industries, Ottoman Bank Building, Baghdad, 
Traq 

Dr. W. D. Hanson. Department of Genetics, North Carolina State College, Raleigh, 
North Carolina, U.S.A. 

Mr. Jacques Hardouin, 35 Via San Girolama, Perugia, Italy 

Mr. J. A. G. Hemptinne, 29 rue A. de Wasseige, Wepion, Fooz, Belgium 

Professor Dr. Leon Hennaux, Ville Beau Sejour, Coril Noirmont (Brabant) Belgium 

Mr. Jean Henry, 3 avenue Copyn Malaise, La Hulpe, Belgium 

Mr. John R. Howell, Route 5, Box 364, Orlando, Florida, U.S.A. 

Dr. Pierre O. Hubinont, 25 avenue Ad. Demeur, Bruxelles 6, Belgium 

Mr. Paul V. Hurt, 4721 Ames Street, Madison 5, Wisconsin, U.S.A. 

Mr. Floribert A. L. G. Jurion, 8 avenue Isidore Gerard, Bruxelles 16, Belgium 

Dr. Richard A, Lang, Department of Statistics, P. O. Box 5457, Raleigh, North 
Carolina, U.S.A. 

Dr. Lonnie L. Lasman, 124 Claridge, Eau Gallie 7, Florida, U.S.A. 

Dr. Jean Lebrun, 12 av. des Lucioles, Brussels, Belgium 

Mr. G. H. Lion, 61 rue Gabrielle, Bruxelles 18, Belgium 

Mr. Donald G. MacEachern, 1717 Brook Avenue, S.E., Minneapolis 14, Minnesota, 

U.S.A. 

Mr. John W. Mayne, 654 Sherbourne Road, Ottawa 3, Ontario, Canada. 

Mr. James H. Meade, Jr., Institute of Statistics, P. O. Box 5457, Raleigh, North 

Carolina, U.S.A. 

Mr. Hyman Menduke, 7808 Haines Road, Cheltenham, Pennsylvania, U.S.A. 

Mr. Forest L. Miller, 111 Villanova Road, Oak Ridge, Tennessee, U.S.A. 

Dr. Sigeiti Moriguti, Faculty of Engineering, University of Tokyo, Bunkyo-ku, 


Tokyo, Japan 
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Mr. Floyd R. Olive, McKamie, Arkansas. U.S.A. 

Mr. James G. Osborne, Old Chapel Hill Road, Route 1, Box 97, Durham, North 
Carolina, U.S.A. 

Mr. Jyun Otsuka, Kaname-machi 14, Toshimaku, Tokyo, Japan 

Miss Florence E. Petzel, Department of Home Economics, University of Texas, 
Austin, Texas, U.S.A. 

Mr. Andre Pieters, 39 Eeklastraat, Mariakerke (Gent) Belgium 

Mr. Jean Reuse, Clos des 4 Vents, Rue de Moorsel, (Brabant) Tervueren, Belgium 

Mr. Julien F. M. Ronchaine, 70 Grand Rue, Gembloux, Belgium 

Mr. Charles E. Rossiter, Pneumoconiosis Research Unit, Llandough Hospital, 
Penarth, Glamorgan, Wales 

Dr. Francesco Sella, Radiation Committee, Room 3468, United Nations, New 
Mopale, ING Na, Wise 

Mr. A. B. Siegelaub, 721 Walton Avenue, New York 21, N. Y., U.S.A. 

Mr. William A. Small, P. O. Box 122-A, Tennessee Polytechnic Institute, Cooke- 
ville. Tennessee, U.S.A. 

Dr. H. Fairfield Smith, FAO-UN, P. O. Box 1555, Teheran, Iran 

Dr. John H. Smith, Department of Mathematics and Statistics, American Uni- 
versity, Washington 16, D.C., U.S.A. 

Mrs. Grace Scholz Spitz, 803 Marlo Drive, Falls Church, Virginia, U.S.A. 

Mr. Francois Sterckx, Aartcentrum 66, Geel, Belgium 

Dr. Howard L. Stier, United Fruit Company, 30 St. James Avenue, Boston 16, 
Massachusetts, U.S.A. 

Dr. Robert J. Taylor, CCNCS-NCI, National Institutes of Health, Bethesda, 
Maryland, U.S.A. 

Prof. Robert S. Temple, College of Agriculture, University of Tennessee, Knox- 
ville, Tennessee, U.S.A. 

Mr. George W. Thomson, 15093 Faust Avenue, Detroit 23, Michigan, U.S.A. 

Mr. M. Ulehla, 8 Carlysle Street, E. Hawthorn, Victoria, Australia 

Prof. A. van den Hende. 233 Coupure Links, Gand, Belgium 

Miss Cecile Van Kerchove, 13 rue Archimede, Brussels, Belgium 

Mrs. Pearl A. Van Natta, 1303 South Eudora, Denver 22, Colorado, U.S.A. 

Dr. B. Veen, van Tienhovenlaan 13, Velp, Netherlands 

Mr. Glen F. Vogel, 6600 Luzon Avenue, NW, Washington 12, DACs ULSvAr 

Prof. Maurice Welsch, 99 Rue General Jacques, Embourg, Belgium 

Dr. R. Wette, M. D. Anderson Hospital, 6723 Bertner Avenue, Houston 25, Texas, 
U.S.A. 

Mr. Robert F. White, Science Information Lab., Smith, Kline and French, Phila- 
delphia, Pennsylvania, U.S.A. 

Dr. Manfred Woelke, Killicklaan 689, Les Marais, Pretoria, South Africa 


New Members 


At Large 


Mr. A. M. Haider, Federation of Industries, Ottoman Bank Building, Baghdad, 
Iraq 

Dr. Helmut V. Muhsam, Professor of Statistics, Hebrew University, Jerusalem, 
Israel 


Australia 


Mr. P. J. Brockwell, Department of Statistics, University of Melbourne, Victoria, 
Australia 
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Mr. Peter D. Finch, Department of Statistics, University of Melbourne, Victoria, 
Australia 

Miss Betty Laby, Department of Statistics, University of Melbourne, Victoria, 
Australia 

Mr. J. S. Maritz, Department of Statistics, University of Melbourne, Victoria, 
Australia 

Miss Ize Raudzins, 36 Boulder Avenue, Redcliffe, W. Australia 

Mr. K. V. Richardson, Department of Statistics, University of Melbourne, Victoria, 
Australia 


ENAR 


Mr. Charles Anello, 5905 Montgomery Street, Baltimore 7, Maryland, U.S.A. 

Mr. Charles E. Antle, 1620 University Avenue, Stillwater, Oklahoma, U.S.A. 

Mr. Thomas P. Bogyo, Department of Experimental Statistics, North Carolina 
State College, Raleigh, North Carolina, U.S.A. 

Mr. DuWayne C. Englert, Population Genetics Institute, Purdue University, 
Lafayette, Indiana, U.S.A. 

Prof. George A. Ferguson, Department of Psychology. McGill University, Montreal, 
Canada 

Mr. R. E. Foster, Forest Pathology Investigations, Department of Forestry, Vic- 
toria, B. C., Canada 

Mrs. Iris M. Kiem, Department of Medicine, University of Miami, Miami 36, 
Florida, U.S.A. 

Miss Joan Klebba, 1469 South 28th Street, Arlington 6, Virginia, U.S.A. 

Mr. James L. Pate, Department of Psychology, University of Alabama, University, 
Alabama, U.S.A. 

Dr. Norman B. Rushforth, Developmental Biology Center, Western Reserve 
University, Cleveland, Ohio, U.S.A. 

Mr. Jack Sebring, 1515 Ogden Street, NW, Washington 10, D. C., U.S.A. 


Germany 
Dr. O. Dittmar, Inst. f. Forstwiss, Alfred-Moeller-Str., Eberswalde, Germany 


Sweden 
Prof. Dr. E. Effenberger, (Hamburg 13) Gartenstrasse 2, Alsterchaussee, Erfurt, 
Sweden 


Switzerland 
Dr. Klaus G. Konig, Zahnarztliches Institut der Universitat, Zurich 28, Postfach, 


Switzerland 

Dr. Thomas M. Marthaler, Dental Institute, University of Zurich, Postfach, 
Zurich 28, Switzerland 

Dr. Heinrich Wagner, Ungargasse 32, Wien III, Austria 

Dr. K. Wuhrmann, Hofstr. 139, Zurich 44, Switzerland 


WNAR 


Dr. G. Eric Bradford, Department of Animal Husbandry, University of California, 
Davis, California, U.S.A. 

Mr. Howard F. Gingerich, 1817144 Berkeley Way, Berkeley 3, California, U.S.A. 

Mr. Melville R. Klauber, Bldg. 315, Stanford Village, Stanford, California, U.S.A. 
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Dr. Orsell M. Meredith, Nuclear Medicine and Radiation Biology, UCLA Medical 
School, Los Angeles 24, California, U.S.A. 

Mr. William L. LeStourgeon, Statistics Group, Los Alamos Scientific Labs., Los 
Alamos, New Mexico, U.S.A. 

Dr. Daniel G. Siegel, Atomic Bomb Casualty Commission, Navy 3912, FPO, 
San Francisco, California, U.S.A. 

Dr. Ervin P. Smith, Animal Science Department, Montana State College, Bozeman, 
Montana, U.S.A. 

Mr. N. Scott Urquhart, Computing Center CSURF, Colorado State University 
Fort Collins, Colorado, U.S.A. 


NEWS AND ANNOUNCEMENTS 


Members are invited to transmit to their National or Regional Secretary (if mem- 
bers at large, to the General Secretary) news of appointments, distinctions, or retirements, 
and announcements of professional interest. 


NEWS ABOUT MEMBERS 


Jacob B. Chassan has been appointed statistician for the Research Department 
of Hoffmann-La Roche, Inc., at Nutley, N. J. 

C. Philip Cox, formerly of the N.I.R.D., Shinfield, England, has joined the staff 
of the Statistical Laboratory and Department of Statistics, Iowa State University, 
as Associate Professor. Professor Cox will be teaching and conducting research 
in the area of biological statistics. 

Seymour Geisser has been recently appointed Chief of Biometry of the National 
Institute of Arthritis and Metabolic Diseases. 

Dewey L. Harris, has been appointed as Assistant Professor to the staff of the 
Statistical Laboratory and Department of Statistics, Iowa State University. Dr. 
Harris will be consulting, teaching and doing research in the area of Genetics 
Statistics. 

Donald H. Hazelwood, formerly NIH Fellow, Department of Zoology, Washing- 
ton State University has taken a position as Assistant Professor of Zoology, Uni- 
versity of Missouri. 

Donald K. Hotchkiss, has been appointed as Assistant Professor to the staff 
of the Statistical Laboratory and Department of Statistics, Iowa State University. 
Dr. Hotchkiss will be consulting, teaching, and doing research in the area of statistics 
applied to animal nutrition. 

Robert Macey is now Assistant Professor, Department of Physiology, Uni- 
versity of California, Berkeley, moving from the University of [Illinois School of 
Medicine. 

J. N. K. Rao of the Statistical Laboratory, and Department of Statistics, lowa 
State University, has been promoted to rank of Assistant Professor. Dr. Rao will 
be teaching and conducting research in survey sampling in his new assignment. 

Elmer E. Remmenga, Chief, Computing Center, Colorado State University, 
will be on sabbatical leave from July, 1961 to July, 1962. 

Patrick K. Tomlinson has recently become an Aquatic Biologist, Ocean Salmon 
Investigation, California Department of Fish and Game. He formerly was in the 
Biostatistics Section. 

Richard W. Vail, Jr. has taken a position as statistician in the Aerospace Cor- 
poration, Florida. He previously was statistician for Aerojet General Corporation, 
Azusa, California. 

Pearl A. Van Natta, formerly Biometrician at the Child Research Council, 
Denver, Colorado, is a self employed statistical consultant. 


SOUTHERN METHODIST UNIVERSITY—NEW PROGRAM IN 
STATISTICS 


Southern Methodist University announces the opening of a new Department 
of Mathematical and Experimental Statistics and Statistical Laboratory in the 
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Graduate School. Faculty members include Drs. Paul D. Minton and Vanamamalai 
Seshadri, assisted part-time by Mr. Del West and Mr. Mack Usher. 

Coursework leading to the degree of Master of Science will be offered, including 
courses in mathematical statistics, experimental statistics, probability, sampling 
theory and design of experiments. 

Departmental assistantships are available. 

For further information, write to Dr. Paul D. Minton, Department of Mathe- 
matical and Experimental Statistics, Southern Methodist University, Dallas 22, 
Texas. 


FLORIDA STATE UNIVERSITY—EXPANDED PROGRAM IN STAT ISTICS 


Beginning in September 1962, the Department of Statistics at the Florida 
State University will expand its graduate program to study and research leading 
to the Doctor of Philosophy degree in statistics. The success of the present program, 
currently limited to study to the Master of Science degree in statistics, bas led to 
the approval of the new advanced training. The curriculum will be modified and 
expanded to include advanced work in statistical inference and decision theory, 
stochastic processes, advanced probability, multivariate analysis, operations re- 
search, special topics in biometry, theory of general linear hypotheses, non-parametric 
statistics and sequential analysis. Advanced courses will be offered in alternate 
years. The program leading to the Master of Science degree will be modified to 
permit both the thesis and non-thesis types of programs. 

The new program will be assisted through the recent addition to the staff of 
Dr. Vincent Hodgson from the London School of Economics and Political Science. 
Dr. Hodgson joints Drs. Ralph A. Bradley, Frank Wilcoxon, Richard G. Cornell 
and §. K. Katti in the Department of Statistics. Additional faculty appointments 
within the next two years are anticipated. 

Facilities have been greatly improved through completion of a new mathe- 
matical sciences building in which instructional, laboratory and office space was 
designed for use of the faculty and graduate students in statistics. The new building 
also houses an IBM 709 computer for research use. 

The Department has a limited number of teaching and research assistantships 
available. Proposals for three-year graduate fellowships are pending and such 
fellowships should be available for graduate students entering the new program in 
September. Interested students are invited to write to Dr. R. A. Bradley, Depart- 
ment of Statistics, the Florida State University, Tallahassee, Florida for further 
information. 
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CORRECTIONS 


Chin Long Chiang [1960]. A Stochastic Study of the Life Table and Its Appli- 
cations: I. Probability Distributions of the Biometric Functions. Biometrics 
16, 618-35. 


In formula (2), page 621, for 1, read l,. In formula (9), page 622, n should be 
deleted. 


Chin Long Chiang [1961]. A Stochastic Study of the Life Table and Its Appli- 
cations: IIT. The Follow-up Study with the Consideration of Competing Risks. 
Biometrics 17, 57-78. 


In line 1, page 62, m should be replaced by n. In formula (21), page 66, for 
S?, read Si, . In formula (32), page 69, for Qez 2 read Qzr12 . In formulas (57) 
and (58), page 73, the super 2 should be deleted from o?. In formula (58), page 73, 
for Azz read Axz. In column (2) of table 4, page 77, for 835.87 read 935.87. In line 5 
from the bottom, page 77, for Q,, read Qrk - 


J. K. Abraham [1960]. 154 Note: On an Alternative Method of Computing Tukey’s 
Statistic for the Latin Square Model. Biometrics 16, 686-691. 


In Table 3, en* should multiply each >°*, 0‘ and >°* appearing, and in Table 4, 
beneath ‘‘Treatments”’, >>* should be similarly corrected. 


BOOKS OF IMPORTANCE IN YOUR FIELD 


Ready in January... 
FIELD PLOT TECHNIQUE 


Completely revised, this edition is both a student text and a researcher’s ref- 
erence manual. Practical aspects of statistical procedures, detailed examples 
of analysis are included . . . the book brings together and explains simply as 
possible the general principles, techniques, assumptions and guides to pro- 
cedures applicable to experimentation in agriculture and biology. 


by Erwin L. LeClerg, United States Department of Agriculture, 
Warren H. Leonard and Andrew G. Clark, Colorado State University, 
January 1962, about 350 pages, price open. 


An Introduction to the 
EXPERIMENTAL METHOD 


This book directs special attention to the design of experiments and the evalua- 
tion of variation by the less complicated statistical methods. The few technical 
terms which are used are fully explained. Mathematical manipulations are at a 
level of arithmetic and secondary school algebra. Both the reader who wants 
to understand research reports, and the investigator who will go on to more 
complex treatises will find the book a valuable aid. 

by James Maxwell Little, Bowman Gray School of Medicine, Wake 
Forest College, 1961, 88 pages, $3.00. 


order from 


BURGESS PUBLISHING COMPANY 


426 South Sixth Street * Minneapolis 15, Minn. 
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Tome I, N° 3-4 Juillet-Octobre, 1960 


SS Eee 


Editorial 


Utilisation de calculateurs électroniques dans l’analyse d’expériences avec 
répétitions et de groupes d’expériences de méme plan expérimental . F. YATES 


Planification des expériences d’alimentation des vaches laitiéres. . . H. L. Lucas 
La sélection artificielle suivant des caractéres quantitatifs..... H. L. Le Roy 


La répartition des sexes chez des cucurbitacées hybrides et 1 leur relation 
avec une distribution contagieuse de Neyman....---------+--- F. WEILING 


La puissance du critére F dans l’analyse de la variance de plans en bloes au 
hasard, nomogrammes pour le choix du nombre de répétitions. ...M. Knuts 


Développement des méthodes biométriques et statistiques dans la recherche 
agronomique au Congo Belge et au Ruanda-Urundi.......... J. M. Henry 


Développement des méthodes biométriques et statistiques dans la recherche 
agronomique en Belgique........--+++2+seeseeree sree settee L. Martin 


L’emploi des méthodes biométriques dans l’étude et l’expérimentation des 
variétés de betterave sucriére en Belgique........ M. Srmwon Et N. ROUSSEL 


Les méthodes d’échantillonnage en sylviculture..........--- ANNE LENGER 


Radiosensibilité des graines d’andropogon issues de terrains uraniféres et non- 
uraniféres du Katanga......DR. J. MEwISSEN, J. Damston, ET Z. M. Baca 


Dix années d’activités de la Biometric Society en Belgique, au Congo Belge 
et au Ruanda-Urundi 


Revue bibliographique. 


oa 


Tome II, N° 1 Mars, 1961 


Quelques problémes de numérations bacterienmes's. amas rian H. Grimm 


Comparaison de deux méthodes d’analyse de données non orthogonales dans 
le cas d’expériences & deux facteurs...... P. GILBERT ET B. GROSSMANN 


Analyse de données non orthogonales dans le cas d’une expériences 4 deux 
PETG Ts hs ee PRS eR ts OO RICIDI RECO U0 0.7 N.C. GEIDK 20 9700 A. Rortt 


Revue bibliographique: D. J. Finney: Experimental Design and its Statistical 
Perit een ee eR RES AS GEO DmOtiraaoamand Golr 8 t,G0c% L. Martin 
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